AlpinDale ef99a567b6 fix: temp_last warning being repeated for every output token (#869) há 1 mês atrás
..
guided_decoding 0256ed236b feat: windows support (#790) há 2 meses atrás
layers ef99a567b6 fix: temp_last warning being repeated for every output token (#869) há 1 mês atrás
model_loader e182d00256 feat: AWQ quantization for InternVL (#867) há 1 mês atrás
models e182d00256 feat: AWQ quantization for InternVL (#867) há 1 mês atrás
__init__.py 7df7b8ca53 optimization: reduce end-to-end overhead from python obj allocation (#666) há 4 meses atrás
_custom_op.py 5d37ec1016 suppress tpu import warning (#696) há 4 meses atrás
parameter.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) há 1 mês atrás
pooling_metadata.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
sampling_metadata.py 2150bb5019 sampler: add range parameter for DRY (#855) há 1 mês atrás
utils.py 9d81716bfd [v0.5.3] Release Candidate (#388) há 8 meses atrás