Anton_wireframe

Author	SHA1	Message	Date
bolade	84e3c7b72a	feat: Implement database ingestion for investors and companies - Added main ingestion logic in main.py to process CSV files for investors and companies. - Implemented data cleaning functions for names, strings, integers, and websites. - Established relationships between investors, companies, and sectors using SQLAlchemy ORM. - Created models for investors, companies, sectors, and their relationships in models.py. - Set up logging for error tracking during data processing. - Initialized database and created necessary tables.	2025-10-07 20:01:19 +01:00
bolade	c0fbbdd917	Implement manual JSON parsing for company profiles; enhance data extraction and processing efficiency; add comprehensive test script for validation	2025-10-07 12:07:43 +01:00
bolade	1f3f08e80d	Remove deprecated stage_focus column and update database path for consistency; add schema verification script and document schema mismatch fixes	2025-10-07 11:31:16 +01:00
bolade	cd7172ed9f	Add test script for manual JSON parser with LLM currency conversion - Implemented a new test script `test_parser.py` to validate the functionality of the manual JSON parser. - The script loads investor data from a CSV file and processes a sample of three investors. - Results include detailed information about each investor, their funds, team members, and investment thesis. - Added error handling for missing API key in the environment variables.	2025-10-06 14:07:28 +01:00
bolade	17bc5acbc8	Refactor investor similarity search to utilize AI for improved query generation; adjust DataFrame parsing to skip initial rows for better data handling.	2025-09-29 15:58:09 +01:00
bolade	6d902345c0	Refactor investor and company schemas to allow optional fields; update filtering logic in read_companies function and add find_similar_investors endpoint; change LLM model in InvestorProcessor and QueryProcessor for improved performance.	2025-09-27 10:45:08 +01:00
bolade	d36367fbe9	Add project management functionality with CRUD operations and associations; introduce project schemas and update main application routing.	2025-09-27 08:53:59 +01:00
bolade	abac19c6ae	Update .gitignore to exclude __pycache__ directories and modify schemas to allow optional fields for better flexibility; adjust batch size in InvestorProcessor for improved processing efficiency.	2025-09-26 15:56:29 +01:00
bolade	f2bbcb96f3	Refactor database models and schemas to allow nullable fields; update init_database function for improved initialization.	2025-09-26 15:24:42 +01:00
bolade	0f7beca5e1	made version 2	2025-09-25 17:00:38 +01:00
bolade	ba0ed169ce	Implement investor processing and querying functionality - Added InvestorProcessor class for processing CSV data in batches and saving to SQL and vector databases. - Introduced QueryProcessor class for querying investor information from SQL and vector databases. - Integrated OpenAI's ChatGPT for structured output generation. - Implemented data cleaning and control character removal in CSV processing. - Added asynchronous processing capabilities for batch handling. - Established connection to ChromaDB for vector storage of investor descriptions. - Defined structured output schemas using Pydantic for investor data validation. - Enhanced settings management for API key and database configurations.	2025-08-29 18:42:55 +01:00

11 Commits