initial commit

This commit is contained in:
2025-07-05 22:43:53 +00:00
parent 0a2488b798
commit 380d0c08a1
+16 -3
View File
@@ -9,12 +9,14 @@ DS Task Tag Scan is an AI-powered clothing tag identification and similarity sea
* **Tag Identification**: Uses computer vision to identify clothing tag brands from images
* **Text-Based Matching**: Implements TF-IDF and cosine similarity for tag name matching
* **Image Similarity Search**: Uses CLIP embeddings to find visually similar tag images
* **LLM Enhancement**: Optional LLM analysis for improved similarity filtering
* **Metadata Extraction**: Provides appraisal values, years, and status information for similar tags
## Tech Stack
* **Computer Vision**: CLIP or ViT models (free from Hugging Face)
* **Text Processing**: TF-IDF vectorization and cosine similarity
* **LLM Enhancement**: Groq LLaVA (optional)
* **Backend**: Flask
* **Image Processing**: Pillow, OpenCV
* **Data Processing**: Pandas, NumPy, scikit-learn
@@ -128,13 +130,15 @@ def identify_tag(image_url):
1. **Tag Identification**: Implement vision-based tag recognition using free models
2. **Text Matching**: Use TF-IDF and cosine similarity for tag matching
3. **Image Similarity**: Implement CLIP-based image embedding and search
4. **Data Processing**: Handle image downloads and metadata extraction
5. **API Design**: Create clean Flask endpoints with proper error handling
4. **LLM Enhancement**: Add optional LLM analysis for better results
5. **Data Processing**: Handle image downloads and metadata extraction
6. **API Design**: Create clean Flask endpoints with proper error handling
## Data Sources
* `tag_guides_clean.json`: Contains tag information with historical images and year ranges
* `expert_data.csv`: Contains tag images with appraisal values, status, and metadata
* `community_data.csv`: Contains tag images with appraisal values, status, and metadata
## Vision Model Options
@@ -144,4 +148,13 @@ You can use any of these free models from Hugging Face:
- EasyOCR (text extraction)
- ResNet (image classification)
All models are available for free to use
## What Success Looks Like
- ✅ Can extract text from tag images
- ✅ Can match brands using TF-IDF similarity
- ✅ Can find visually similar images
- ✅ Optional: LLM enhances similarity analysis
- ✅ Returns properly formatted JSON response
- ✅ Response time under 60 seconds
All models are available for free to use. Good Luck !