Add Vision Transformer (ViT) Demo to computer_vision Module

### Feature description

This PR adds a fully functional demo of a Vision Transformer (ViT) for image classification using Hugging Face Transformers.

Features included:
- Loads a sample image from the web.
- Uses ViTImageProcessor for preprocessing.
- Performs inference with ViTForImageClassification.
- Prints the predicted label for the image.
- Handles network timeout and correct import order.

Example output:
Predicted label: tabby, tabby cat

This demo can be used as a reference for anyone learning ViT or image classification with Hugging Face Transformers.

Additional Notes:
- Requires torch, transformers, and PIL.
- Can be extended to classify local images easily.

Hacktoberfest 2025: ✅


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add Vision Transformer (ViT) Demo to computer_vision Module #13372

Feature description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Add Vision Transformer (ViT) Demo to computer_vision Module #13372

Description

Feature description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions