AlpinDale 2aabf8fcf7 vlm: fix errors on ragged NestedTensors (#953) hace 2 semanas
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) hace 2 semanas
audio.py 3693028340 feat: support for Audio modality (#698) hace 4 meses
base.py 2aabf8fcf7 vlm: fix errors on ragged NestedTensors (#953) hace 2 semanas
image.py 9f3e7c86e2 feat: add fused Marlin MoE kernel (#934) hace 2 semanas
registry.py 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) hace 3 meses
utils.py 03bd85c950 chore: multi-image support for llava-next (#935) hace 2 semanas