Autor | SHA1 Nachricht | Datum |
---|---|---|
|
e26a4ac698 chore: avoid loading the unused layers and init the VLM up to the required feature space | vor 5 Monaten |
|
c11a8bdaad fix: calculate max number of multi-modal tokens automatically | vor 5 Monaten |
|
4599c98f99 feat: dynamic image size support for VLMs | vor 5 Monaten |
|
3a0fdf7b9b chore: remove `image_input_type` from VLM config | vor 6 Monaten |
|
c0c336aaa3 refactor: registry for processing model inputs; quick_gelu; clip model support | vor 6 Monaten |