Commit History

Author SHA1 Message Date
  AlpinDale 72cd8494aa feat: mistral neuron support (#368) 10 months ago
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) 10 months ago
  AlpinDale b361096463 fix: tokenizer when using ray (#366) 10 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 10 months ago
  50h100a a39920bc99 Merge pull request #355 from 50h100a/pr_seedfix 10 months ago
  50h100a 051c60736e Merge pull request #356 from 50h100a/pr_samplerinternals 10 months ago
  50h100a d5dbd29db4 hoist sampler internals into a single function. 10 months ago
  50h100a b9e0ae87c5 fix fine-grained seeding. 10 months ago
  sgsdxzy 6ebac34dc1 chore: cleaner pre-llamafied Yi implementation (#352) 10 months ago
  AlpinDale 681e94611f fix: restore backwards compatibility with old Yi models (#351) 10 months ago
  AlpinDale 1b6732fcde chore: bump transformers version 10 months ago
  Absurd 070c1cef8c fix: explicit RFC3986 for prometheus_client asgi (#344) 10 months ago
  Stefan Daniel Schwarz 5d747cfc4d readme: docker docs (#340) 10 months ago
  Stefan Daniel Schwarz 8e259ee7cf chore: hf_transfer for faster downloads (#339) 11 months ago
  AlpinDale 398a97338a feat: enable lora loading/unloading via API (#337) 11 months ago
  Stefan Daniel Schwarz b0688b6b9c fix: docker port and kobold api (#338) 11 months ago
  AlpinDale ed225f59cb fix: transformers in requirements 11 months ago
  AlpinDale e120404436 Revert "feat: CMake Build System Generator (#332)" 11 months ago
  AlpinDale 06312251a7 fix: explictly export CUDA arches for CI 11 months ago
  AlpinDale e53842bd5d fix: cuda home detection for fp8 kv cache 11 months ago
  AlpinDale 7411a74cc6 bump version to 0.5.2 11 months ago
  AlpinDale ad6802690f feat: CMake Build System Generator (#332) 11 months ago
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 11 months ago
  AlpinDale e2a7b50440 fix: logprobs when inf or nan (#329) 11 months ago
  AlpinDale 4791a63fdc fix: env.py url in bugs template 11 months ago
  AlpinDale 8071ead964 chore: allow docker port and host to be changed (#327) 11 months ago
  AlpinDale 594fe814dc bump version to v0.5.1 (#326) 11 months ago
  AlpinDale f8652c8e99 fix: optimize aqlm dequantization (#325) 11 months ago
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) 11 months ago
  AlpinDale 637649df99 fix: model -> model architecture in issue templates 11 months ago