AlpinDale
|
e177788401
add moe tests
|
8 months ago |
AlpinDale
|
9da2946652
update kernel tests
|
8 months ago |
AlpinDale
|
2172a9c374
add fp8_e4m3fn scales for llama2 7b and 70b
|
8 months ago |
AlpinDale
|
e28c8496b2
endpoint tests
|
8 months ago |
AlpinDale
|
ce10891496
add more engine-related tests:
|
8 months ago |
AlpinDale
|
bfca95c1a8
update detokenization test
|
8 months ago |
AlpinDale
|
9e1cea354c
add distributed system tests
|
8 months ago |
AlpinDale
|
54ed7adef3
add core processor tests
|
8 months ago |
AlpinDale
|
3a206f9e11
add chunked prefill correctness test
|
8 months ago |
AlpinDale
|
0558b22749
basic correctness test
|
8 months ago |
AlpinDale
|
9082ac7b7a
add async engine test units
|
8 months ago |
sgsdxzy
|
fcfb72af24
Support arbitrary model in GGUF. (#381)
|
8 months ago |
AlpinDale
|
bd0ddf1cfe
feat: EETQ quantization (#408)
|
8 months ago |
AlpinDale
|
b1caee23a6
cache the p2p access check for memory saving
|
8 months ago |
AlpinDale
|
373e0d3c01
fix neuron
|
8 months ago |
AlpinDale
|
28bcca2396
incorrect use of monotonic time in metrics logger
|
8 months ago |
AlpinDale
|
4ba273886a
debug logging for distributed_init_method
|
8 months ago |
AlpinDale
|
1270b5567e
triton compile error for flash_attn
|
8 months ago |
AlpinDale
|
f375353026
enable custom_all_reduce by default in llm.py
|
8 months ago |
AlpinDale
|
2d2b43fe00
fix type hint
|
8 months ago |
AlpinDale
|
531969a0b2
move merge_async_iterators to common utils
|
8 months ago |
AlpinDale
|
c18bf116da
fix stop strings not being excluded from outputs
|
8 months ago |
AlpinDale
|
5ab7a159d7
fix formatting for previous commit
|
8 months ago |
AlpinDale
|
b6bbf584ac
fix echo
|
8 months ago |
AlpinDale
|
6e0761ba5d
make init_distributed_environment compatible with init_process_group
|
8 months ago |
AlpinDale
|
083ba7b452
roll back chunked prefill changes to SDPA, isolate cpu worker
|
8 months ago |
AlpinDale
|
8c67b37131
fix docstrings
|
8 months ago |
AlpinDale
|
fe17712f29
fully working chunked prefill
|
8 months ago |
AlpinDale
|
8db2fa8e2e
why was that not committed?
|
8 months ago |
AlpinDale
|
54678c91f3
fix outlines requirements
|
8 months ago |