.. |
async_aphrodite
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
basic_correctness
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
benchmarks
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
compile
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
core
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
distributed
|
d69273bd2b
ray: better error when placement group topology is incorrect (#906)
|
3 weeks ago |
endpoints
|
0c162c8dad
api: use fp32 for base64 embeddings (#919)
|
3 weeks ago |
engine
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
kernels
|
93bc863591
feat: Machete Kernels for Hopper GPUs (#842)
|
1 month ago |
lora
|
68f050129d
fix: lora worker manager test import
|
1 month ago |
metrics
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
modeling
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
models
|
fce970a846
feat: multi-image input support for Phi3V (#917)
|
3 weeks ago |
multi_step
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
multimodal
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
plugins
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
prefix_caching
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
prompt_adapter
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
prompts
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
quantization
|
7f1c9af5e2
fix: fp8 quant test
|
1 month ago |
samplers
|
2150bb5019
sampler: add range parameter for DRY (#855)
|
1 month ago |
spec_decode
|
3b684a8a54
spec decode: streamline batch expansion tensor manipulation (#918)
|
3 weeks ago |
tensorizer_loader
|
673621a3d2
xpu: refactor the model runner for tensor parallelism (#910)
|
3 weeks ago |
tokenization
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
tpu
|
b0a8169b54
core: do not compile for profiling
|
2 weeks ago |
weight_loading
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
worker
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
3 weeks ago |
__init__.py
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
conftest.py
|
653d1a08d4
feat: add support for audio models (#891)
|
3 weeks ago |
test_cache_block_hashing.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_config.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_embedded_commit.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_inputs.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_logits_processor.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_regression.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_sampling_params.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_scalartype.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_sequence.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_sharded_state_loader.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
test_utils.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
utils.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |