david/aphrodite-engine: PygmalionAI's large-scale inference engine pygmalion.chat It is designed to serve as the inference endpoint for the PygmalionAI website, and to allow serving the Pygmalion models to a large number of users with blazing fast speeds (thanks to vLLM's Paged Attention). @ 5240c0da23c3ac7df244377a6613b38c269cdcf5

AlpinDale cda0e93a10 abstract away the platform for device capability		7 місяців тому
..
compressed_tensors	cda0e93a10 abstract away the platform for device capability	7 місяців тому
gguf_utils	9d81716bfd [v0.5.3] Release Candidate (#388)	10 місяців тому
__init__.py	517676249c chore: update the compressed-tensors config	7 місяців тому
aqlm.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	7 місяців тому
autoquant.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	7 місяців тому
awq.py	17f7089e26 fix: `get_min_capability` for all quants	7 місяців тому
base_config.py	0f4a9ee77b quantized lm_head (#582)	7 місяців тому
bitsandbytes.py	17f7089e26 fix: `get_min_capability` for all quants	7 місяців тому
deepspeedfp.py	4acf34417a feat: add DeepSpeedFP quantization for all models	8 місяців тому
eetq.py	17f7089e26 fix: `get_min_capability` for all quants	7 місяців тому
exl2.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	7 місяців тому
fp8.py	cda0e93a10 abstract away the platform for device capability	7 місяців тому
gguf.py	17f7089e26 fix: `get_min_capability` for all quants	7 місяців тому
gptq.py	0f4a9ee77b quantized lm_head (#582)	7 місяців тому
gptq_marlin.py	cda0e93a10 abstract away the platform for device capability	7 місяців тому
gptq_marlin_24.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	7 місяців тому
hadamard.safetensors	9d81716bfd [v0.5.3] Release Candidate (#388)	10 місяців тому
marlin.py	0f4a9ee77b quantized lm_head (#582)	7 місяців тому
quip.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	7 місяців тому
quip_utils.py	9d81716bfd [v0.5.3] Release Candidate (#388)	10 місяців тому
schema.py	9d81716bfd [v0.5.3] Release Candidate (#388)	10 місяців тому
squeezellm.py	17f7089e26 fix: `get_min_capability` for all quants	7 місяців тому