Commit History

Author SHA1 Message Date
  50h100a d3dd170a7d merge main 9 months ago
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) 9 months ago
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) 9 months ago
  AlpinDale e3252edd07 fix: remove event and stream, add typing (#382) 10 months ago
  AlpinDale 375f24ccca fix: optimize context shift performance (#380) 10 months ago
  AlpinDale 33b3786175 fix: cache neuron checks (#379) 10 months ago
  AlpinDale 8c9cabf4c8 fix: display error in ray before deadlock (#378) 10 months ago
  50h100a dc09dc2b4d Merge branch 'main' into pr_samplers 10 months ago
  AlpinDale f587953f46 fix: yapf 10 months ago
  AlpinDale 4b99ac15b7 fix: do not deepcopy metadata 10 months ago
  AlpinDale 17b034613d chore: make metadata a dataclass (#377) 10 months ago
  AlpinDale 9534fcfb7b fix: build error 10 months ago
  AlpinDale 0b35176089 feat: add context-free grammars (#376) 10 months ago
  AlpinDale feb5840f2a feat: async tokenization (#374) 10 months ago
  IggoOnCode 2aec297c55 feat: add embeddings endpoint to openai rest-api server. (#363) 10 months ago
  AlpinDale 29c241c115 fix: explicitly disallow installation on non-linux platforms (#373) 10 months ago
  AlpinDale 439a826712 fix: broadcast group 10 months ago
  AlpinDale 935027bdcc feat: dynamic shared memory allocation for moe align block size (#372) 10 months ago
  AlpinDale 97a2b26c97 fix: assertion error when use_sliding_window is present 10 months ago
  AlpinDale e702f587cf feat: add batched RoPE kernels (#371) 10 months ago
  AlpinDale 3d6695cfbb feat: add approximate gelu activation kernels (#370) 10 months ago
  AlpinDale 5fa15b4435 fix: double free with sliding window (#369) 10 months ago
  AlpinDale 72cd8494aa feat: mistral neuron support (#368) 10 months ago
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) 10 months ago
  AlpinDale b361096463 fix: tokenizer when using ray (#366) 10 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 10 months ago
  50h100a 35b4aa7da5 Fix logitproc for logit_bias in OAI endpoints. 10 months ago
  50h100a 7ed57e318d Overhauled SamplingTensors construction. 10 months ago
  50h100a a39920bc99 Merge pull request #355 from 50h100a/pr_seedfix 10 months ago
  50h100a 051c60736e Merge pull request #356 from 50h100a/pr_samplerinternals 10 months ago