AlpinDale cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 주 전
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) 1 개월 전
audio.py 3693028340 feat: support for Audio modality (#698) 5 달 전
base.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 주 전
image.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 주 전
registry.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 주 전
utils.py be59e30139 vlm: add support for video modality + llava next video (#1014) 1 개월 전
video.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 2 주 전