Commit History

Author SHA1 Message Date
  AlpinDale c1c37c755d bump version to 0.6.0 5 months ago
  AlpinDale 75122b20ef chore: refactor wheel build script 5 months ago
  AlpinDale 638784c3c9 docs: fix typos 5 months ago
  AlpinDale cc7a636ffd ci: add action for deploying docs 5 months ago
  AlpinDale 57968a5053 docs: finalize User & Developer Documentation for Release Candidate (#618) 5 months ago
  AlpinDale 208cd5405f fix: cpu offloading with gptq 5 months ago
  AlpinDale 28946766fb fix: allow loading GGUF model without .gguf extension 5 months ago
  AlpinDale 2b85ffb1a5 chore: minor cleanups 5 months ago
  AlpinDale 2424207fac ci: remove isort 5 months ago
  AlpinDale c4933b1a6d ci: remove yapf from the formatting script 5 months ago
  AlpinDale 616de67ff5 ci: remove yapf 5 months ago
  AlpinDale f18eeaf59a ci: codespell fixes 5 months ago
  AlpinDale 4d4e767838 ci: take one of fixing lint issues 5 months ago
  AlpinDale 6b1f96586b ci: remove clang-format 5 months ago
  AlpinDale dc00aa7b17 ci: a few more ignores 5 months ago
  AlpinDale e63be8e46c minor CI fixes 5 months ago
  AlpinDale 0e6c400b13 feat: re-add GGUF (#600) 5 months ago
  AlpinDale 6c1eab6a6c feat: non-blocking transfer in prepare_input 5 months ago
  AlpinDale 2a349ca3e1 fix: specify device when loading lora and embedding tensors 5 months ago
  AlpinDale 9d98f29b3a chore: update cutlass to 3.5.1 5 months ago
  AlpinDale bd210a6cf6 fix: use args.trust_remote_code 5 months ago
  AlpinDale e8008f24ed fix: use ipv4 localhost form for zmq bind 5 months ago
  AlpinDale 6c2e24de53 fix: support flashinfer for draft model runner 5 months ago
  AlpinDale edffcecc67 chore: add proper logging for spec decoding verification 5 months ago
  AlpinDale c3ee71a437 feat: port SiglipVisionModel from transformers 5 months ago
  AlpinDale 040e5af52b refactor: factor out code for running uvicorn again 5 months ago
  AlpinDale 9a50e3b4eb refactor: minicpmv and port Idefix2VisionTransformer 5 months ago
  AlpinDale 29f0478f90 chore: simplify output processing with shortcut for non-parallel sampling and non-beam search usecase (#616) 5 months ago
  AlpinDale b6c97e4d16 feat: add guided decoding to LLM 5 months ago
  AlpinDale 212b9d8a03 refactor: add has_prefix_cache_hit flag to FlashAttentionMetadataBuilder 5 months ago