.. |
async_aphrodite
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
basic_correctness
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 maand geleden |
benchmarks
|
b5aa11020b
api: fix crashes under very high loads (#878)
|
4 weken geleden |
compile
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
core
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 maand geleden |
distributed
|
2f61644f6e
SPMD optimizations (#824)
|
1 maand geleden |
endpoints
|
ce6e3d63f7
api: better startup failure UX (#881)
|
4 weken geleden |
engine
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
kernels
|
93bc863591
feat: Machete Kernels for Hopper GPUs (#842)
|
1 maand geleden |
lora
|
68f050129d
fix: lora worker manager test import
|
1 maand geleden |
metrics
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
modeling
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
models
|
e182d00256
feat: AWQ quantization for InternVL (#867)
|
1 maand geleden |
multi_step
|
48a8693aed
feat: multi-step scheduling (#831)
|
1 maand geleden |
multimodal
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
plugins
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
prefix_caching
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 maand geleden |
prompt_adapter
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
prompts
|
e1f3fd1e02
fix: test units (#201)
|
1 jaar geleden |
quantization
|
7f1c9af5e2
fix: fp8 quant test
|
1 maand geleden |
samplers
|
2150bb5019
sampler: add range parameter for DRY (#855)
|
1 maand geleden |
spec_decode
|
16b587c104
fix: hidden states handling in batch expansion for spec decoding (#839)
|
1 maand geleden |
tensorizer_loader
|
32a37e8107
tests: partially fix tensorizer and logprobs tests
|
1 maand geleden |
tokenization
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
weight_loading
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
worker
|
48a8693aed
feat: multi-step scheduling (#831)
|
1 maand geleden |
__init__.py
|
2755a48d51
merge dev branch into main (#153)
|
1 jaar geleden |
conftest.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_cache_block_hashing.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_config.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_embedded_commit.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_inputs.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_logits_processor.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_regression.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_sampling_params.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_scalartype.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_sequence.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_sharded_state_loader.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
test_utils.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |
utils.py
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 maand geleden |