initial commit
This commit is contained in:
@@ -9,12 +9,14 @@ DS Task Tag Scan is an AI-powered clothing tag identification and similarity sea
|
||||
* **Tag Identification**: Uses computer vision to identify clothing tag brands from images
|
||||
* **Text-Based Matching**: Implements TF-IDF and cosine similarity for tag name matching
|
||||
* **Image Similarity Search**: Uses CLIP embeddings to find visually similar tag images
|
||||
* **LLM Enhancement**: Optional LLM analysis for improved similarity filtering
|
||||
* **Metadata Extraction**: Provides appraisal values, years, and status information for similar tags
|
||||
|
||||
## Tech Stack
|
||||
|
||||
* **Computer Vision**: CLIP or ViT models (free from Hugging Face)
|
||||
* **Text Processing**: TF-IDF vectorization and cosine similarity
|
||||
* **LLM Enhancement**: Groq LLaVA (optional)
|
||||
* **Backend**: Flask
|
||||
* **Image Processing**: Pillow, OpenCV
|
||||
* **Data Processing**: Pandas, NumPy, scikit-learn
|
||||
@@ -128,13 +130,15 @@ def identify_tag(image_url):
|
||||
1. **Tag Identification**: Implement vision-based tag recognition using free models
|
||||
2. **Text Matching**: Use TF-IDF and cosine similarity for tag matching
|
||||
3. **Image Similarity**: Implement CLIP-based image embedding and search
|
||||
4. **Data Processing**: Handle image downloads and metadata extraction
|
||||
5. **API Design**: Create clean Flask endpoints with proper error handling
|
||||
4. **LLM Enhancement**: Add optional LLM analysis for better results
|
||||
5. **Data Processing**: Handle image downloads and metadata extraction
|
||||
6. **API Design**: Create clean Flask endpoints with proper error handling
|
||||
|
||||
## Data Sources
|
||||
|
||||
* `tag_guides_clean.json`: Contains tag information with historical images and year ranges
|
||||
* `expert_data.csv`: Contains tag images with appraisal values, status, and metadata
|
||||
* `community_data.csv`: Contains tag images with appraisal values, status, and metadata
|
||||
|
||||
## Vision Model Options
|
||||
|
||||
@@ -144,4 +148,13 @@ You can use any of these free models from Hugging Face:
|
||||
- EasyOCR (text extraction)
|
||||
- ResNet (image classification)
|
||||
|
||||
All models are available for free to use
|
||||
## What Success Looks Like
|
||||
- ✅ Can extract text from tag images
|
||||
- ✅ Can match brands using TF-IDF similarity
|
||||
- ✅ Can find visually similar images
|
||||
- ✅ Optional: LLM enhances similarity analysis
|
||||
- ✅ Returns properly formatted JSON response
|
||||
- ✅ Response time under 60 seconds
|
||||
|
||||
|
||||
All models are available for free to use. Good Luck !
|
||||
Reference in New Issue
Block a user