AlpinDale a1f18f17e6 modify the cache engine and model runner/worker to support mamba states пре 9 месеци
..
__init__.py 04b53d2db5 chore: add initializer files пре 1 година
cache_engine.py a1f18f17e6 modify the cache engine and model runner/worker to support mamba states пре 9 месеци
cpu_model_runner.py 6e0761ba5d make init_distributed_environment compatible with init_process_group пре 9 месеци
cpu_worker.py 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker пре 9 месеци
model_runner.py a1f18f17e6 modify the cache engine and model runner/worker to support mamba states пре 9 месеци
neuron_model_runner.py 0f1399c135 feat: attention refactor part 2 пре 10 месеци
neuron_worker.py 4d33ce60da feat: Triton flash attention backend for ROCm (#407) пре 9 месеци
worker.py a1f18f17e6 modify the cache engine and model runner/worker to support mamba states пре 9 месеци
worker_base.py 8c67b37131 fix docstrings пре 9 месеци