.. |
__init__.py
|
8b42b58228
vlm: stack multimodal tensors to represent multiple images within each prompt (#937)
|
2 hafta önce |
audio.py
|
3693028340
feat: support for Audio modality (#698)
|
4 ay önce |
base.py
|
2aabf8fcf7
vlm: fix errors on ragged NestedTensors (#953)
|
2 hafta önce |
image.py
|
9f3e7c86e2
feat: add fused Marlin MoE kernel (#934)
|
2 hafta önce |
registry.py
|
89a2c6dee1
chore: refactor `MultiModalConfig` initialization and profiling (#745)
|
3 ay önce |
utils.py
|
03bd85c950
chore: multi-image support for llava-next (#935)
|
2 hafta önce |