AlpinDale cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 1 週間 前
..
__init__.py 8b42b58228 vlm: stack multimodal tensors to represent multiple images within each prompt (#937) 1 ヶ月 前
audio.py 3693028340 feat: support for Audio modality (#698) 4 ヶ月 前
base.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 1 週間 前
image.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 1 週間 前
registry.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 1 週間 前
utils.py be59e30139 vlm: add support for video modality + llava next video (#1014) 1 ヶ月 前
video.py cc5e185795 VLM: support passing multimodal processor kwargs (#1102) 1 週間 前