Commit Graph

11 Commits

Author SHA1 Message Date
rafaelsideguide c1f98d0371 fixed developer.notion special case 2024-10-11 10:54:59 -03:00
rafaelsideguide 6208ecdbc0 added logger 2024-07-23 17:30:46 -03:00
rafaelsideguide 0175152577 Fixed PDF match custom scraping
Now it's working for both `https://getgc.ai/privacy` and `https://prairie.cards/products/wood-designs` usecases.
2024-07-02 11:25:17 -03:00
rafaelsideguide 5f69fc7677 Fixed the regex test 2024-06-25 18:24:01 -03:00
rafaelsideguide e37d151404 added parsePDF option to pageOptions
user can decide if they are going to let us take care of the parse or they are going to parse the pdf by themselves
2024-06-12 15:06:47 -03:00
Nicolas 7cb14edec8 Nick: 2024-06-05 10:13:52 -07:00
Rafael Miller 9e000ded03 Merge branch 'main' into feat/better-gdrive-pdf-fetch 2024-06-05 14:07:56 -03:00
rafaelsideguide ccc55127d6 Added scroll xpaths on fire-engine for handling readme docs 2024-06-05 11:48:41 -03:00
rafaelsideguide b5045d1661 [feat] improved the scrape for gdrive pdfs 2024-06-04 17:47:28 -03:00
Nicolas 96257b7b17 Update handleCustomScraping.ts 2024-06-04 12:22:46 -07:00
Nicolas 674500affa Nick: 2024-06-04 12:15:39 -07:00