.. |
adapter_commons
|
99680b2d23
feat: soft prompts (#589)
|
пре 6 месеци |
attention
|
a2d476183f
fix: remove scipy and re-implement CSR matrix
|
пре 6 месеци |
common
|
ddb28a80a3
fix: bump torch for rocm, unify CUDA_VISIBLE_DEVICES for cuda and rocm
|
пре 6 месеци |
distributed
|
cc6399792f
fix: keep consistent with how pytorch finds libcudart.so
|
пре 6 месеци |
endpoints
|
a3b56353fa
fix: another one missed
|
пре 6 месеци |
engine
|
63becc67c0
fix: prompt logprob detokenization
|
пре 6 месеци |
executor
|
4501ae5f15
fix: neuron executor for adapters
|
пре 6 месеци |
inputs
|
4f7d212b70
feat: remove vision language config
|
пре 6 месеци |
kv_quant
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
пре 1 година |
lora
|
99680b2d23
feat: soft prompts (#589)
|
пре 6 месеци |
modeling
|
d2f38f6f81
chore: remove separate bias add
|
пре 6 месеци |
multimodal
|
c11a8bdaad
fix: calculate max number of multi-modal tokens automatically
|
пре 6 месеци |
platforms
|
1a40bf438b
fix: incorrect gpu capability when used mixed gpus
|
пре 6 месеци |
processing
|
99680b2d23
feat: soft prompts (#589)
|
пре 6 месеци |
prompt_adapter
|
99680b2d23
feat: soft prompts (#589)
|
пре 6 месеци |
quantization
|
500f3b654f
fix: support bias term in compressed-tensors quant
|
пре 6 месеци |
spec_decode
|
16dff9babc
chore: enable bonus token in spec decoding for KV cache based models
|
пре 6 месеци |
task_handler
|
ddb28a80a3
fix: bump torch for rocm, unify CUDA_VISIBLE_DEVICES for cuda and rocm
|
пре 6 месеци |
transformers_utils
|
63becc67c0
fix: prompt logprob detokenization
|
пре 6 месеци |
__init__.py
|
a07fc83bc8
chore: proper util for aphrodite version
|
пре 7 месеци |
_custom_ops.py
|
ad24e74a99
feat: FP8 weight-only quantization support for Ampere GPUs
|
пре 6 месеци |
_ipex_ops.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
пре 7 месеци |
py.typed
|
1c988a48b2
fix logging and add py.typed
|
пре 1 година |
version.py
|
7e54c3916d
chore: factor out epilogues from cutlass kernels
|
пре 7 месеци |