AlpinDale 39beed0b87 Revert "Refactor AWQ support." 1 éve
..
tensor_parallel 39beed0b87 Revert "Refactor AWQ support." 1 éve
README.md 386be46787 fix: megatron-lm url 1 éve
__init__.py 76b2e4a445 Merge dev branch into main (#7) 1 éve
parallel_state.py 4be81db3d4 chore: adapt the megatron code for parallelism 1 éve

README.md

The files here are from the NVIDIA Megatron-LM repository, but only with inference-related code.