AlpinDale
|
1da305b94f
wip
|
3 months ago |
AlpinDale
|
a604ab69c4
fix: kobold api for horde (#763)
|
3 months ago |
AlpinDale
|
0e0bd02b52
ci: bump version to 0.6.2 (#758)
|
3 months ago |
AlpinDale
|
5878e887f2
docs: update readme and docs (#757)
|
3 months ago |
AlpinDale
|
d7309453f6
fix: add pandas to requirements (#756)
|
3 months ago |
AlpinDale
|
73177656ed
feat: quant_llm support (#755)
|
3 months ago |
AlpinDale
|
ad181e3fef
feat: bring back dynatemp (#754)
|
3 months ago |
AlpinDale
|
6329c2d53f
chore: re-enable custom token bans (#751)
|
3 months ago |
Ahmed
|
55261b09d6
ci: fix docs deployment (#750)
|
3 months ago |
Ahmed
|
aecd80a47b
Merge pull request #749 from PygmalionAI/ci/fix-pnpm-install
|
3 months ago |
Ahmed
|
4435a443e1
ci: fix dep install using pnpm
|
3 months ago |
AlpinDale
|
abd9d5799a
feat: add XTC Sampling (#740)
|
3 months ago |
AlpinDale
|
4434c4db84
chore: refactor llama3 rope (#748)
|
3 months ago |
AlpinDale
|
9d9722b1c1
fix: metrics endpoint with RPC server (#747)
|
3 months ago |
AlpinDale
|
81c5f196eb
chore: various TPU fixes and optimizations (#746)
|
3 months ago |
AlpinDale
|
89a2c6dee1
chore: refactor `MultiModalConfig` initialization and profiling (#745)
|
3 months ago |
AlpinDale
|
1068597e8a
fix: minor bug fixes & clean-ups (#744)
|
3 months ago |
Geun, Lim
|
08711d2ac9
feat: add Exaone model support (#743)
|
3 months ago |
AlpinDale
|
81c28d2a7f
fix: use nvml to get consistent device names (#739)
|
3 months ago |
AlpinDale
|
5559c5886f
fix: clear engine ref in RPC server (#738)
|
3 months ago |
AlpinDale
|
ef3a0f4cb1
fix: `custom_ar` check (#737)
|
3 months ago |
AlpinDale
|
ccbda97416
fix: types in AQLM and GGUF for dynamo support (#736)
|
3 months ago |
AlpinDale
|
9296d4b25d
feat: dynamo support for ScalarType (#733)
|
3 months ago |
AlpinDale
|
d9d85eeb6e
chore: register lora functions as torch ops (#732)
|
3 months ago |
AlpinDale
|
7a313483f1
chore: move update_flash_attn_metadata to attn backend (#731)
|
3 months ago |
AlpinDale
|
d34e083c48
feat: add experts_int8 support (#730)
|
3 months ago |
AlpinDale
|
b0f262eec1
feat: FP8 quantization support for AMD ROCm (#729)
|
3 months ago |
AlpinDale
|
c744443679
ci: bump to 0.6.1.post1 (#728)
|
3 months ago |
miku448
|
9c0e7d95c8
fix: libcudart path for some versions of pytorch (#726)
|
3 months ago |
AlpinDale
|
4648f16c84
chore: fix return statement in Detokenizer class (#727)
|
3 months ago |