Instructions to use zai-org/cogvlm2-llama3-caption with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zai-org/cogvlm2-llama3-caption with Transformers:
# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("zai-org/cogvlm2-llama3-caption", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
No module named 'torchvision.transforms.functional_tensor'
even upgrading and updating the library still this issue
No module named 'torchvision.transforms.functional_tensor'
Swapping import torchvision.transforms.functional_tensor as F_t in pytorchvideo to import torchvision.transforms.functional as F_t works fine for me.
I think that's from a change of torchvision lib.
Command to do the above patch in Colab. If you are not on Colab, change the path to your python location.!sed -i '9s/.*/import torchvision.transforms.functional as F_t/' /usr/local/lib/python3.11/dist-packages/pytorchvideo/transforms/augmentations.py
After applying what @lly0571 suggested, the error is gone, but I encountered a new situation:
Error processing video A_turtle_is_walking_on_the_sand.MP4: 'CogVLMVideoForCausalLM' object has no attribute '_extract_past_from_model_output'