Commit Graph

10 Commits

Author SHA1 Message Date
Gergő Móricz b415e625a0 feat(scrape): get job result from GCS, avoid Redis (#1461)
* feat(scrape): get job result from GCS, avoid Redis

* call logjob on scrapes

* Fix inverse bool

* fix more

* migrate gracefully

* refactor

* feat(tests/search): test with scrape
2025-04-15 00:07:44 +02:00
Gergő Móricz f18a6b20ff extract concurrency hotfix 2025-04-11 20:38:54 +02:00
Gergő Móricz 6a10f0689d ACUC: Dynamic Limits (FIR-1641) (#1434)
* extend acuc definition

* kill plan

* stuff

* stupid tests

* feat: better acuc

* feat(acuc): mock ACUC when not using db auth
2025-04-10 18:49:23 +02:00
Gergő Móricz 24f5199359 compare format (FIR-1560) (#1405) 2025-04-02 19:52:43 +02:00
Nicolas 04c6f511b5 (feat/extract) Add sources to the extraction (#1101)
* Nick: good state

* Nick: source tracker class

* Nick: show sources under flag
2025-01-28 13:46:21 -03:00
rafaelmmiller c1a2981d59 default onlyMainContent=false for extract 2025-01-27 14:31:16 -03:00
Móricz Gergő d3518e85a8 feat(extract): add logging 2025-01-23 12:05:15 +01:00
Nicolas 5030fea634 Update document-scraper.ts 2025-01-20 13:28:59 -03:00
Nicolas 6b2e1cbb28 Nick: cache /extract scrapes 2025-01-03 21:19:40 -03:00
Nicolas 233f347f5e Nick: refactor 2024-12-26 12:41:37 -03:00