Commit History

Author SHA1 Message Date
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) 9 months ago
  AlpinDale e3252edd07 fix: remove event and stream, add typing (#382) 9 months ago
  AlpinDale 375f24ccca fix: optimize context shift performance (#380) 9 months ago
  AlpinDale 33b3786175 fix: cache neuron checks (#379) 9 months ago
  AlpinDale 8c9cabf4c8 fix: display error in ray before deadlock (#378) 9 months ago
  50h100a dc09dc2b4d Merge branch 'main' into pr_samplers 9 months ago
  AlpinDale f587953f46 fix: yapf 9 months ago
  AlpinDale 4b99ac15b7 fix: do not deepcopy metadata 9 months ago
  AlpinDale 17b034613d chore: make metadata a dataclass (#377) 9 months ago
  AlpinDale 9534fcfb7b fix: build error 9 months ago
  AlpinDale 0b35176089 feat: add context-free grammars (#376) 9 months ago
  AlpinDale feb5840f2a feat: async tokenization (#374) 9 months ago
  IggoOnCode 2aec297c55 feat: add embeddings endpoint to openai rest-api server. (#363) 9 months ago
  AlpinDale 29c241c115 fix: explicitly disallow installation on non-linux platforms (#373) 9 months ago
  AlpinDale 439a826712 fix: broadcast group 9 months ago
  AlpinDale 935027bdcc feat: dynamic shared memory allocation for moe align block size (#372) 9 months ago
  AlpinDale 97a2b26c97 fix: assertion error when use_sliding_window is present 9 months ago
  AlpinDale e702f587cf feat: add batched RoPE kernels (#371) 9 months ago
  AlpinDale 3d6695cfbb feat: add approximate gelu activation kernels (#370) 9 months ago
  AlpinDale 5fa15b4435 fix: double free with sliding window (#369) 9 months ago
  AlpinDale 72cd8494aa feat: mistral neuron support (#368) 9 months ago
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) 9 months ago
  AlpinDale b361096463 fix: tokenizer when using ray (#366) 9 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 months ago
  50h100a 35b4aa7da5 Fix logitproc for logit_bias in OAI endpoints. 9 months ago
  50h100a 7ed57e318d Overhauled SamplingTensors construction. 9 months ago
  50h100a a39920bc99 Merge pull request #355 from 50h100a/pr_seedfix 9 months ago
  50h100a 051c60736e Merge pull request #356 from 50h100a/pr_samplerinternals 9 months ago
  50h100a d5dbd29db4 hoist sampler internals into a single function. 9 months ago
  50h100a b9e0ae87c5 fix fine-grained seeding. 9 months ago