AlpinDale 4d4e767838 ci: take one of fixing lint issues há 4 meses atrás
..
__init__.py 04b53d2db5 chore: add initializer files há 1 ano atrás
cache_engine.py 5289c14b24 feat: Asymmetric Tensor Parallel (#594) há 5 meses atrás
cpu_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs há 4 meses atrás
cpu_worker.py 42c66d5b00 feat: tensor parallelism for CPU backend há 4 meses atrás
embedding_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs há 4 meses atrás
model_runner.py 4d4e767838 ci: take one of fixing lint issues há 4 meses atrás
model_runner_base.py d8a51d05a7 fix: seeded gens with pipeline parallel há 4 meses atrás
neuron_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs há 4 meses atrás
neuron_worker.py ae04f57ec1 feat: Pipeline Parallel support (#581) há 5 meses atrás
openvino_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs há 4 meses atrás
openvino_worker.py 1ff6d4c3d7 feat: support pipeline parallel on indivisible GPU count (#587) há 5 meses atrás
tpu_model_runner.py eef647deab fix: greedy decoding in TPU há 4 meses atrás
tpu_worker.py 269e9aabda fix: set readonly=True for non-root TPU devices há 4 meses atrás
worker.py 6979ff658e chore: perform allreduce in fp32 for marlin, better logging há 4 meses atrás
worker_base.py 523ac99aca chore: pipeline parallel with Ray accelerated dag há 4 meses atrás
xpu_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs há 4 meses atrás
xpu_worker.py 99680b2d23 feat: soft prompts (#589) há 5 meses atrás