Gergő Móricz
b415e625a0
feat(scrape): get job result from GCS, avoid Redis ( #1461 )
...
* feat(scrape): get job result from GCS, avoid Redis
* call logjob on scrapes
* Fix inverse bool
* fix more
* migrate gracefully
* refactor
* feat(tests/search): test with scrape
2025-04-15 00:07:44 +02:00
Gergő Móricz
f18a6b20ff
extract concurrency hotfix
2025-04-11 20:38:54 +02:00
Gergő Móricz
6a10f0689d
ACUC: Dynamic Limits (FIR-1641) ( #1434 )
...
* extend acuc definition
* kill plan
* stuff
* stupid tests
* feat: better acuc
* feat(acuc): mock ACUC when not using db auth
2025-04-10 18:49:23 +02:00
Gergő Móricz
24f5199359
compare format (FIR-1560) ( #1405 )
2025-04-02 19:52:43 +02:00
Nicolas
04c6f511b5
(feat/extract) Add sources to the extraction ( #1101 )
...
* Nick: good state
* Nick: source tracker class
* Nick: show sources under flag
2025-01-28 13:46:21 -03:00
rafaelmmiller
c1a2981d59
default onlyMainContent=false for extract
2025-01-27 14:31:16 -03:00
Móricz Gergő
d3518e85a8
feat(extract): add logging
2025-01-23 12:05:15 +01:00
Nicolas
5030fea634
Update document-scraper.ts
2025-01-20 13:28:59 -03:00
Nicolas
6b2e1cbb28
Nick: cache /extract scrapes
2025-01-03 21:19:40 -03:00
Nicolas
233f347f5e
Nick: refactor
2024-12-26 12:41:37 -03:00