david/aphrodite-engine: PygmalionAI's large-scale inference engine pygmalion.chat It is designed to serve as the inference endpoint for the PygmalionAI website, and to allow serving the Pygmalion models to a large number of users with blazing fast speeds (thanks to vLLM's Paged Attention). @ ee174ea4fd3bda4f206c74e0c3f53d978fb10475

AlpinDale a33aaf3b42 chore: cleanup compressed tensors		hai 7 meses
..
compressed_tensors	a33aaf3b42 chore: cleanup compressed tensors	hai 7 meses
gguf_utils	9d81716bfd [v0.5.3] Release Candidate (#388)	hai 10 meses
__init__.py	517676249c chore: update the compressed-tensors config	hai 7 meses
aqlm.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
autoquant.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
awq.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
base_config.py	c66b1b57b1 Marlin 2:4 sparsity (#555)	hai 7 meses
bitsandbytes.py	690110a051 feat: bitsandbytes quantization	hai 7 meses
deepspeedfp.py	4acf34417a feat: add DeepSpeedFP quantization for all models	hai 7 meses
eetq.py	b178ae4b4a chore: generalize linear_method to be quant_method (#540)	hai 8 meses
exl2.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
fp8.py	7e54c3916d chore: factor out epilogues from cutlass kernels	hai 7 meses
gguf.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
gptq.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
gptq_marlin.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
gptq_marlin_24.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
hadamard.safetensors	9d81716bfd [v0.5.3] Release Candidate (#388)	hai 10 meses
marlin.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
quip.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses
quip_utils.py	9d81716bfd [v0.5.3] Release Candidate (#388)	hai 10 meses
schema.py	9d81716bfd [v0.5.3] Release Candidate (#388)	hai 10 meses
squeezellm.py	156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)	hai 7 meses