AlpinDale f644e10449 vlm: enable multimodal inputs for the LLM class (#992) hai 2 meses
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) hai 2 meses
audio.py 3693028340 feat: support for Audio modality (#698) hai 6 meses
base.py 2aabf8fcf7 vlm: fix errors on ragged NestedTensors (#953) hai 2 meses
image.py 9f3e7c86e2 feat: add fused Marlin MoE kernel (#934) hai 2 meses
registry.py 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) hai 5 meses
utils.py f644e10449 vlm: enable multimodal inputs for the LLM class (#992) hai 2 meses