AlpinDale cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 settimane fa
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) 1 mese fa
audio.py 3693028340 feat: support for Audio modality (#698) 5 mesi fa
base.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 settimane fa
image.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 settimane fa
registry.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 settimane fa
utils.py be59e30139 vlm: add support for video modality + llava next video (#1014) 1 mese fa
video.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 settimane fa