Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale bf15e1b4e8 chore: deprecation warning for beam search há 6 meses atrás
  AlpinDale 7e9d4f3c71 chore: some more marlin cleanups há 6 meses atrás
  AlpinDale 34fc26c869 chore: bump lmfe version to 0.10.3 há 6 meses atrás
  AlpinDale 23408b9b2b chore: skip the driver worker há 6 meses atrás
  AlpinDale d8f9f0ec16 fix: prefix prefill kernels for fp32 data type há 6 meses atrás
  AlpinDale 92bcbdf975 fix: megacore setting for TPU v5e-litepod há 6 meses atrás
  AlpinDale 0c17c2a8a7 chore: add commit hash, clean up engine logs há 6 meses atrás
  AlpinDale cdc0e498a9 fix: illegal memory access in FP8 MoE kernel há 6 meses atrás
  AlpinDale b1e61268a8 bump torch to 2.3.1 há 6 meses atrás
  AlpinDale 05e45aeb53 fix: dtype mismatch for paligemma há 6 meses atrás
  AlpinDale 500f3b654f fix: support bias term in compressed-tensors quant há 6 meses atrás
  AlpinDale d2f38f6f81 chore: remove separate bias add há 6 meses atrás
  AlpinDale ddb28a80a3 fix: bump torch for rocm, unify CUDA_VISIBLE_DEVICES for cuda and rocm há 6 meses atrás
  AlpinDale a2d476183f fix: remove scipy and re-implement CSR matrix há 6 meses atrás
  AlpinDale 5ac65d2d49 chore: bump optimum-intel há 6 meses atrás
  AlpinDale cc6399792f fix: keep consistent with how pytorch finds libcudart.so há 6 meses atrás
  AlpinDale 63becc67c0 fix: prompt logprob detokenization há 6 meses atrás
  AlpinDale 0ab35652d3 fix: llava 1.6 feature size calculation há 6 meses atrás
  AlpinDale 058e629f8e chore: refactor marlin python utils há 6 meses atrás
  AlpinDale c0c2b1ac20 fix: get_and_reset only when scheduler outputs are not empty há 6 meses atrás
  AlpinDale b9268be8e8 fix: engine timeout due to request abort há 6 meses atrás
  AlpinDale 8a44866e00 restrict outlines to < 0.1 há 6 meses atrás
  AlpinDale 4501ae5f15 fix: neuron executor for adapters há 6 meses atrás
  AlpinDale 16dff9babc chore: enable bonus token in spec decoding for KV cache based models há 6 meses atrás
  AlpinDale 4150b1ea3a fix: adapter methods for OpenVINO executor há 6 meses atrás
  AlpinDale db73f03cdc fix: use ParallelLMHead for MLPSpeculator há 6 meses atrás
  AlpinDale 9622c59f8f chore: support 2D input shape in MoE layer há 6 meses atrás
  AlpinDale 4628caeae6 fix: missed these adapter methods for TPU executor há 6 meses atrás
  AlpinDale dba22e4f83 fix: add zeromq fallback for broadcasting large objects (e.g. vlm images) há 6 meses atrás
  AlpinDale d9f4c36edd feat: Medusa speculative decoding support (#590) há 6 meses atrás