david/aphrodite-engine: PygmalionAI's large-scale inference engine pygmalion.chat It is designed to serve as the inference endpoint for the PygmalionAI website, and to allow serving the Pygmalion models to a large number of users with blazing fast speeds (thanks to vLLM's Paged Attention). @ llama3-rope

AlpinDale ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736)		3 months ago
..
all_reduce	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
attention	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
backup	f8dfac6372 chore: attention refactor and upstream sync apr01 (#365)	9 months ago
core	9296d4b25d feat: dynamo support for ScalarType (#733)	3 months ago
cpu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
hadamard	5d288aa76c feat: add fast hadamard transformation kernels (#232)	11 months ago
mamba	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
moe	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
prepare_inputs	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
punica	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
quantization	ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736)	3 months ago
activation_kernels.cu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
cache.h	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
cache_kernels.cu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
cuda_compat.h	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
cuda_utils.h	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
cuda_utils_kernels.cu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
dispatch_utils.h	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
layernorm_kernels.cu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
ops.h	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
pos_encoding_kernels.cu	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
reduction.cuh	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 months ago
torch_bindings.cpp	a401f8e05d feat: per-tensor token epilogue kernels (#630)	4 months ago