Commit Graph

74 Commits

Author SHA1 Message Date
rafaelsideguide dc189e1e9d feat: webhooks config on v1 2024-08-30 16:22:59 -03:00
Nicolas d872bf0c4c Merge branch 'main' into v1-webscraper 2024-08-28 12:42:23 -03:00
Nicolas 1e08e6d317 Merge branch 'main' into nsc/job-priority 2024-08-27 15:01:58 -03:00
Gergo Moricz 96e91ab9ec convert webhook call to v1 2024-08-25 14:05:46 +02:00
Gergő Móricz e7f267b6fe Merge branch 'main' into v1-webscraper 2024-08-23 17:21:54 +02:00
rafaelsideguide 7473b74021 fix: html and rawlhtmls for pdfs 2024-08-22 15:15:45 -03:00
Gergő Móricz 6d48dbcd38 feat(sentry): add trace continuity for queue 2024-08-22 16:47:38 +02:00
Nicolas 477c3257dc Nick: 2024-08-21 22:53:33 -03:00
Nicolas c7bfe4ffe8 Nick: 2024-08-21 22:20:40 -03:00
Nicolas 0ea0a5db46 Nick: wip 2024-08-21 20:54:39 -03:00
Gergő Móricz 85ff0c311e Add worker ID to job attribute 2024-08-21 19:21:29 +02:00
Gergő Móricz 920702cdde Update builder to handle uploading sourcemaps 2024-08-21 19:08:03 +02:00
Gergő Móricz 86942728e3 Add metadata for queue-worker and Express 2024-08-21 17:58:27 +02:00
Nicolas 35decb1af2 Nick: 2024-08-21 12:35:03 -03:00
Gergő Móricz 70d50b3640 fix(queue-worker): move dotenv config up 2024-08-20 19:25:19 +02:00
Nicolas 5abd26a267 Nick: set the crawl limit to the remaining credits 2024-08-20 14:16:54 -03:00
rafaelsideguide 4ffc60596a Update queue-worker.ts 2024-08-19 09:29:23 -03:00
Nicolas 3fe82b4f12 Update queue-worker.ts 2024-08-17 03:09:31 -04:00
Nicolas f797380112 Nick: 2024-08-16 22:17:38 -04:00
Nicolas 47123be783 Nick: weird activity block 2024-08-16 22:01:56 -04:00
Gergő Móricz 803577eeba feat(crawl): webhook 2024-08-16 23:42:48 +02:00
Gergő Móricz aabfaf0ac5 clean up crawl-status, fix db ddos 2024-08-16 23:29:39 +02:00
Gergő Móricz fd6432e7fd fix(queue-worker): correct job success 2024-08-16 19:16:08 +02:00
Gergő Móricz 6e54942265 fix(queue-worker): add cancelled to crawl log 2024-08-16 19:11:53 +02:00
Gergő Móricz d0a8382a5b fix(queue-worker): crawl finishing race condition 2024-08-16 18:48:52 +02:00
Gergő Móricz 6bd52e63bf fix(queue-worker): fix linksOnPage undefined error 2024-08-16 18:42:24 +02:00
Gergő Móricz 5a6570cba2 fix(webhooks): call back with parent crawl ID 2024-08-16 17:42:42 +02:00
Gergő Móricz 965a5817d1 fix(queue-worker): log jobs correctly 2024-08-15 19:27:15 +02:00
Gergő Móricz 846610681b fix: fix posthog, add dummy crawl DB items 2024-08-15 18:55:18 +02:00
Gergő Móricz 8a5cad72f6 fix(queue-worker): variable name collision 2024-08-14 22:02:05 +02:00
Gergő Móricz a6c81f9d62 fix: return all data when calling webhook 2024-08-14 17:53:47 +02:00
Gergo Moricz 2e5e480cc2 fix(crawl): call webhooks 2024-08-13 22:10:17 +02:00
Gergo Moricz 8dbac0268c feat: offload crawl results to the DB 2024-08-13 21:40:59 +02:00
Gergo Moricz 2413e33359 fix(queue-worker): remove console.log 2024-08-13 21:07:36 +02:00
Gergo Moricz d7549d4dc5 feat: remove webScraperQueue 2024-08-13 21:03:24 +02:00
Gergo Moricz 86e136beca feat: crawl to scrape conversion 2024-08-13 20:51:43 +02:00
Nicolas e28c415cf4 Nick: 2024-08-09 14:07:46 -04:00
Gergo Moricz 920b7f2f44 fix(runWebScraper): don't filter empty docs 2024-08-07 21:00:22 +02:00
Gergo Moricz 55ec96c23f fix(queue-worker): bad job lock extension time 2024-08-07 20:24:16 +02:00
Gergo Moricz ab7a35c581 fix(queue-worker): log lock extensions 2024-08-07 19:49:48 +02:00
Gergo Moricz a1c2ee5aa9 fix: always complete job, no try 2024-08-07 19:39:09 +02:00
Gergo Moricz 191dfbd9ca fix: move to completed in one place 2024-08-07 18:49:58 +02:00
Gergő Móricz 5fc7fcb77c Merge branch 'main' into feat/queue-scrapes 2024-08-07 16:35:44 +02:00
Gergo Moricz fe9fdb578b revert bad hotfixes 2024-08-07 16:34:25 +02:00
Gergo Moricz cdf7bad5b4 fix(runWebScraper): don't move to completed 2024-08-07 15:20:56 +02:00
Gergo Moricz 9df8719efa fix(queue-worker): raise queue log level to info 2024-08-07 14:56:04 +02:00
Gergo Moricz 7bb922071c fix(queue-worker): manually renew lock (testing) 2024-08-07 14:35:20 +02:00
Gergo Moricz 7c5cda7b45 fix(queue-worker): concurrency 2024-08-06 16:57:00 +02:00
Gergo Moricz 03c84a9372 cleanup and fix cancelling 2024-08-06 16:26:46 +02:00
Nicolas 52198f2991 Nick: 2024-07-30 16:15:08 -04:00