Commit History

Author SHA1 Message Date
  AlpinDale 398a97338a feat: enable lora loading/unloading via API (#337) 10 months ago
  Stefan Daniel Schwarz b0688b6b9c fix: docker port and kobold api (#338) 10 months ago
  AlpinDale ed225f59cb fix: transformers in requirements 11 months ago
  AlpinDale e120404436 Revert "feat: CMake Build System Generator (#332)" 11 months ago
  AlpinDale 06312251a7 fix: explictly export CUDA arches for CI 11 months ago
  AlpinDale e53842bd5d fix: cuda home detection for fp8 kv cache 11 months ago
  AlpinDale 7411a74cc6 bump version to 0.5.2 11 months ago
  AlpinDale ad6802690f feat: CMake Build System Generator (#332) 11 months ago
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 11 months ago
  AlpinDale e2a7b50440 fix: logprobs when inf or nan (#329) 11 months ago
  AlpinDale 4791a63fdc fix: env.py url in bugs template 11 months ago
  AlpinDale 8071ead964 chore: allow docker port and host to be changed (#327) 11 months ago
  AlpinDale 594fe814dc bump version to v0.5.1 (#326) 11 months ago
  AlpinDale f8652c8e99 fix: optimize aqlm dequantization (#325) 11 months ago
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) 11 months ago
  AlpinDale 637649df99 fix: model -> model architecture in issue templates 11 months ago
  AlpinDale 31092ad5ae fix: issues template 11 months ago
  AlpinDale e544814a92 feat: add issue template and an env info collector (#321) 11 months ago
  AlpinDale 89c32b40ec chore: add new imatrix quants (#320) 11 months ago
  sgsdxzy 50c0875c32 chore: log total memory usage (#316) 11 months ago
  AlpinDale e82b654ddd readme: add tabby, fix docker, add colab (#315) 11 months ago
  AlpinDale fa07e6db61 docker: build docker for all CUDA arches 11 months ago
  drummerv e59dd4a90d fix: openai gguf chat template (#312) 11 months ago
  AlpinDale b3df2351c8 readme: update with bsz1 graph 11 months ago
  AlpinDale 434dc19961 CI: fix build failure for cuda versions with no torch wheels 11 months ago
  AlpinDale 968bde81bf fix: tensor parallel with GPTQ and AWQ quants (#307) 11 months ago
  AlpinDale ff898c2c80 bump version to 0.5.0 (#303) 11 months ago
  AlpinDale c41462cfcd feat: exllamav2 quantization (#305) 11 months ago
  AlpinDale 3a045ebfde fix: escape tags in loguru (#304) 11 months ago
  AlpinDale 9ec611090d chore: build for more cuda versions 11 months ago