Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 656459fd84 make fp8_e4m3 work on nvidia hai 6 meses
  AlpinDale 50b7c13db0 refactor: attention selector (#552) hai 6 meses
  AlpinDale 9fba7f1d36 remove quant_config from a few legacy models hai 6 meses
  AlpinDale b178ae4b4a chore: generalize linear_method to be quant_method (#540) hai 6 meses
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) hai 7 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 8 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 10 meses
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) hai 10 meses
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) hai 10 meses
  AlpinDale c41462cfcd feat: exllamav2 quantization (#305) hai 11 meses
  AlpinDale 2370dbcfd8 feat: OPT model support (#266) hai 11 meses