AlpinDale cc5e185795 VLM: support passing multimodal processor kwargs (#1102) vor 2 Wochen
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) vor 1 Monat
audio.py 3693028340 feat: support for Audio modality (#698) vor 5 Monaten
base.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) vor 2 Wochen
image.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) vor 2 Wochen
registry.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) vor 2 Wochen
utils.py be59e30139 vlm: add support for video modality + llava next video (#1014) vor 1 Monat
video.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) vor 2 Wochen