AlpinDale f644e10449 vlm: enable multimodal inputs for the LLM class (#992) há 2 meses atrás
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) há 2 meses atrás
audio.py 3693028340 feat: support for Audio modality (#698) há 6 meses atrás
base.py 2aabf8fcf7 vlm: fix errors on ragged NestedTensors (#953) há 2 meses atrás
image.py 9f3e7c86e2 feat: add fused Marlin MoE kernel (#934) há 2 meses atrás
registry.py 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) há 5 meses atrás
utils.py f644e10449 vlm: enable multimodal inputs for the LLM class (#992) há 2 meses atrás