commit 778782e2058bf8b9cd3d55efdf4ba2a871156df6 Author: kowshik Date: Wed Mar 19 04:53:54 2025 +0600 First Commit diff --git a/README.md b/README.md new file mode 100644 index 0000000..a3094cc --- /dev/null +++ b/README.md @@ -0,0 +1,64 @@ +# Mini SpecsComply Pro (SCP) + +## Overview + +Mini SpecsComply Pro (SCP) is a lightweight document compliance and validation tool designed to analyze and verify technical documents against predefined standards and project-specific requirements. It leverages advanced AI models for embedding, reasoning, and ranking to ensure fast and accurate document processing. + +## Key Features + +- **Document Embedding:** Uses Cohere Embedding Model to generate vector representations for efficient comparison. +- **Fast LLM Processing:** GROQ LLM provides rapid document parsing and analysis. +- **Advanced Reasoning:** Claude 3.5 Sonnet is used for deep reasoning and compliance verification. +- **Enhanced Ranking:** Cohere Reranker ensures the most relevant compliance issues are prioritized. +- **Structured Compliance Feedback:** Generates summaries and detailed issue breakdowns for document corrections. +- **Efficient Resubmission Workflow:** Allows users to revise and resubmit documents based on feedback. + +## Tech Stack + +- **Backend:** Python (FastAPI or Flask for API development) +- **Vector Database:** Pinecone or Weaviate for document embeddings storage and retrieval +- **LLMs:** + + - GROQ for quick responses + - Claude 3.5 Sonnet for reasoning +- **Embedding & Reranking:** + + - Cohere Embedding Model + - Cohere Reranker + +## Workflow + +1. **Document Upload** + + - User uploads a document for compliance verification. + - Document is converted into embeddings using the Cohere Embedding Model. + - Stored in the vector database for efficient retrieval. +2. **Processing & Analysis** + + - GROQ LLM parses the document and extracts key sections. + - Claude 3.5 Sonnet performs reasoning to check compliance against standards. + - Cohere Reranker prioritizes the most critical compliance issues. +3. **Compliance Report Generation** + + - A structured report is generated, including: + - **Summary of Findings** + - **Detailed Compliance Issues** + - **Recommended Fixes** +4. **Feedback & Resubmission** + + - User receives feedback and revises the document. + - Resubmitted documents undergo the same pipeline for re-evaluation. + +## API Endpoints (Example) + +```yaml +POST /upload-document + - Uploads a document for analysis + - Returns document ID for tracking + +GET /document/{doc_id}/analysis + - Retrieves the compliance report for a document + +POST /document/{doc_id}/resubmit + - Allows resubmission of a revised document +``` diff --git a/data/1.Invitation to Tender.docx b/data/1.Invitation to Tender.docx new file mode 100644 index 0000000..41346e8 Binary files /dev/null and b/data/1.Invitation to Tender.docx differ diff --git a/data/2.Tender Specifications.docx b/data/2.Tender Specifications.docx new file mode 100644 index 0000000..2c277a4 Binary files /dev/null and b/data/2.Tender Specifications.docx differ diff --git a/data/3.Bill of Quantities.docx b/data/3.Bill of Quantities.docx new file mode 100644 index 0000000..b34a5aa Binary files /dev/null and b/data/3.Bill of Quantities.docx differ diff --git a/data/4.Scope of Work.docx b/data/4.Scope of Work.docx new file mode 100644 index 0000000..4a2b96d Binary files /dev/null and b/data/4.Scope of Work.docx differ diff --git a/data/7.Supplier SQualification requirements.docx b/data/7.Supplier SQualification requirements.docx new file mode 100644 index 0000000..7e65562 Binary files /dev/null and b/data/7.Supplier SQualification requirements.docx differ diff --git a/data/8.form of tender.docx b/data/8.form of tender.docx new file mode 100644 index 0000000..5474e82 Binary files /dev/null and b/data/8.form of tender.docx differ diff --git a/data/9.confidentiality agreement.docx b/data/9.confidentiality agreement.docx new file mode 100644 index 0000000..42d49a6 Binary files /dev/null and b/data/9.confidentiality agreement.docx differ diff --git a/data/Project1-FEED CONTRACTOR-MUL-E000-PR-LST-0001_Rev002 - Equipment List (1).docx b/data/Project1-FEED CONTRACTOR-MUL-E000-PR-LST-0001_Rev002 - Equipment List (1).docx new file mode 100644 index 0000000..8ae002c Binary files /dev/null and b/data/Project1-FEED CONTRACTOR-MUL-E000-PR-LST-0001_Rev002 - Equipment List (1).docx differ