AlpinDale 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
..
__init__.py 04b53d2db5 chore: add initializer files il y a 1 an
cache_engine.py 5289c14b24 feat: Asymmetric Tensor Parallel (#594) il y a 4 mois
cpu_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
cpu_worker.py 42c66d5b00 feat: tensor parallelism for CPU backend il y a 4 mois
embedding_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
model_runner.py 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
model_runner_base.py d8a51d05a7 fix: seeded gens with pipeline parallel il y a 4 mois
neuron_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
neuron_worker.py ae04f57ec1 feat: Pipeline Parallel support (#581) il y a 4 mois
openvino_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
openvino_worker.py 1ff6d4c3d7 feat: support pipeline parallel on indivisible GPU count (#587) il y a 4 mois
tpu_model_runner.py eef647deab fix: greedy decoding in TPU il y a 4 mois
tpu_worker.py 269e9aabda fix: set readonly=True for non-root TPU devices il y a 4 mois
worker.py 6979ff658e chore: perform allreduce in fp32 for marlin, better logging il y a 4 mois
worker_base.py 523ac99aca chore: pipeline parallel with Ray accelerated dag il y a 4 mois
xpu_model_runner.py 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
xpu_worker.py 99680b2d23 feat: soft prompts (#589) il y a 4 mois