Commit Graph

1470 Commits

Author SHA1 Message Date
rafaelsideguide
48056ea1bd feat: added go html to md parser 2024-09-02 14:15:56 -03:00
rafaelsideguide
036eb925a2 fix(v1): maxDepth 2024-09-02 09:55:35 -03:00
Nicolas
86b0477708 Update map.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-09-01 19:54:15 -03:00
Nicolas
304b3e752e Update map.ts 2024-09-01 19:52:21 -03:00
Nicolas
cb2dfe29be Nick: 2024-09-01 15:06:36 -03:00
Gergő Móricz
980293652d fix(queue-worker): new getJobs, log on v0 2024-09-01 19:29:35 +02:00
Nicolas
44fe741c35 Update queue-worker.ts 2024-09-01 14:19:43 -03:00
Nicolas
758f729ae2 Update queue-worker.ts 2024-09-01 14:10:47 -03:00
Nicolas
5c05bb12a7 Update webhook.ts 2024-09-01 13:58:50 -03:00
Nicolas
ae90370543 Update webhook.ts 2024-09-01 13:58:07 -03:00
Nicolas
0df2441d7f Nick: sdks good 2024-09-01 13:54:41 -03:00
Nicolas
b68a50fea3 Nick: 2024-09-01 13:52:34 -03:00
Nicolas
979697df1c Update queue-worker.ts 2024-09-01 13:47:51 -03:00
Nicolas
95b9dc915d Nick: webhooks v1 working great 2024-09-01 13:44:36 -03:00
Nicolas
faae98ecb8 Merge branch 'main' into v1/webhooks 2024-09-01 12:56:11 -03:00
Nicolas
fe6abe8f7d Nick: 2024-09-01 12:55:02 -03:00
Nicolas
2ef43d5fc2 Update rate-limiter.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-31 14:26:16 -03:00
Nicolas
8431be5826 Nick: 2024-08-31 14:23:55 -03:00
Nicolas
af5cc5f16b Nick: 1.2 - v1 llm extract 2024-08-31 13:25:48 -03:00
Nicolas
0c59564393 Merge branch 'main' into v1/llm-extract 2024-08-30 21:31:49 -03:00
Gergő Móricz
693dc14d9b remove invalid keys
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-31 00:08:24 +02:00
Nicolas
6d1da2e22a Update index.ts 2024-08-30 18:45:02 -03:00
Nicolas
d347160ff9 Nick: 2024-08-30 17:32:41 -03:00
Nicolas
522d256b11 Merge branch 'main' into v1/llm-extract 2024-08-30 17:16:21 -03:00
Nicolas
ef5c8931eb Update scrape.ts 2024-08-30 17:09:44 -03:00
Nicolas
050cac51ec Update types.ts 2024-08-30 17:02:29 -03:00
Nicolas
012bc74e14 Revert "Update types.ts"
This reverts commit 77766cf00d.
2024-08-30 17:02:05 -03:00
Nicolas
77766cf00d Update types.ts 2024-08-30 17:01:54 -03:00
rafaelsideguide
87e61f2d51 v0 working 2024-08-30 16:38:55 -03:00
rafaelsideguide
dc189e1e9d feat: webhooks config on v1 2024-08-30 16:22:59 -03:00
Nicolas
23dfe90baa Nick: 2024-08-30 15:43:31 -03:00
Nicolas
0732997a13 Merge branch 'main' into v1/llm-extract 2024-08-30 15:42:45 -03:00
Gergo Moricz
c8e0bb93ea fix(crawl-status): handle null data when purging rawHtml 2024-08-30 20:40:57 +02:00
Nicolas
4f92bb4bf4 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-30 15:29:47 -03:00
Nicolas
282962e36f Nick: 2024-08-30 15:29:41 -03:00
Gergo Moricz
6a6b487474 fix(v1): don't fail on doc = null 2024-08-30 20:28:18 +02:00
Nicolas
71dab56e36 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-30 15:21:29 -03:00
Nicolas
d7dbc2536d Update crawl.ts 2024-08-30 15:21:22 -03:00
Nicolas
9de39cfa2c
Merge pull request #591 from mendableai/mog/websocket-sdk
[v1] Websockets SDKs
2024-08-30 15:06:06 -03:00
Nicolas
234c6daee8 Update supabase-jobs.ts 2024-08-30 14:52:59 -03:00
Nicolas
7c8ca0c889 Update types.ts 2024-08-30 13:54:04 -03:00
Nicolas
f4a49bc1d1 Update types.ts 2024-08-30 13:49:16 -03:00
Nicolas
a2881e9288 Nick: llm extract support on node sdk 2024-08-30 13:43:19 -03:00
Nicolas
45e33563eb Nick: python working 2024-08-30 12:58:38 -03:00
Nicolas
bb4808443c Merge branch 'main' into v1/llm-extract 2024-08-30 12:48:26 -03:00
Nicolas
2fd9c285bd
Merge pull request #582 from mendableai/fix--rawHtml-return-on-crawl
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
[Bug] Moved delete rawHtml to end of controller
2024-08-30 12:40:06 -03:00
Nicolas
b8920d6f4a Nick: 2024-08-30 12:37:45 -03:00
Nicolas
552328d168 Merge branch 'main' into v1/llm-extract 2024-08-30 12:34:50 -03:00
Nicolas
ee3e5dc69c Nick: 2024-08-30 12:34:45 -03:00
Gergő Móricz
26d87bd099
fix(api): handle zoderrors earlier 2024-08-30 17:33:42 +02:00
Nicolas
1805d901a9 Update credit_billing.ts 2024-08-30 12:05:37 -03:00
Nicolas
e5ca4364ba Nick: improvements to llm extract error handling 2024-08-30 11:57:55 -03:00
Nicolas
52ac132328
Update auth.ts 2024-08-30 11:10:48 -03:00
Rafael Miller
636e39d366
Merge branch 'main' into mog/websocket-sdk 2024-08-30 11:09:09 -03:00
rafaelsideguide
ae38c26fa8 feat(v1-sdks): async crawl node, python websocket + async crawl + example 2024-08-30 10:09:39 -03:00
Eric Ciarla
7caaee28c5 Update ui component to v1 2024-08-29 21:40:44 -04:00
Nicolas
41eb620959 Nick: prompt option, still need to convert to new structured outputs 2024-08-29 21:00:57 -03:00
Nicolas
49e1cb7ca0 Nick: 2024-08-29 20:08:06 -03:00
Nicolas
63264644e1 Nick: fixed js-sdk map params
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-29 20:06:55 -03:00
rafaelsideguide
377e8ded34 removed v0 support 2024-08-29 17:40:43 -03:00
rafaelsideguide
5f11275fe7 Merge remote-tracking branch 'origin/main' into mog/websocket-sdk 2024-08-29 17:14:59 -03:00
Nicolas
db85f1a7d5 Update index.ts 2024-08-29 17:11:16 -03:00
Rafael Miller
eec6d86802
Merge branch 'main' into f/rust-sdk 2024-08-29 16:20:03 -03:00
Gergő Móricz
53018a683f feat(js-sdk): add crawlUrlAndWatch 2024-08-29 20:01:16 +02:00
Gergő Móricz
d4001e4528 fix(api/websocket): fix auth and termination 2024-08-29 20:01:00 +02:00
Nicolas
1174bc3c82 Nick: blocked sw/ra
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-29 13:55:54 -03:00
rafaelsideguide
7eefaffe48 fix: rawHtml for v0 2024-08-29 12:24:14 -03:00
rafaelsideguide
f8323f9b11 fix: moved delete rawHtml to end of controller 2024-08-29 10:02:05 -03:00
Nicolas
02410aca36 Update runWebScraper.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-28 22:39:50 -03:00
Nicolas
67229c6b3a Nick: fixed credits issue
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-28 18:32:45 -03:00
rafaelsideguide
c3158b0f98 fix(v1): js-sdk fixed crawl type 2024-08-28 16:27:29 -03:00
Nicolas
70bff7f8fb Nick: 2024-08-28 15:59:20 -03:00
Nicolas
8be75accb8 Nick: 2024-08-28 15:40:30 -03:00
rafaelsideguide
c7b3365ffd fix(v1): update readme - v1.0.1 2024-08-28 15:15:29 -03:00
Nicolas
547da88ccd Update rate-limiter.test.ts 2024-08-28 14:53:02 -03:00
Nicolas
4edd9a34c7 Nick: totalCount -> total, completedCount -> completed 2024-08-28 14:46:53 -03:00
Nicolas
2a56f89f50 Update index.test.ts 2024-08-28 14:24:26 -03:00
Nicolas
59547d7129 Update index.test.ts 2024-08-28 14:22:26 -03:00
rafaelsideguide
b7f62f93c4 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:18:07 -03:00
rafaelsideguide
1c35534e2a fix(v1): check-status for preview 2024-08-28 14:18:05 -03:00
Nicolas
af67b14c9d Nick: 2024-08-28 14:17:59 -03:00
Nicolas
8fbff28009 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:09:20 -03:00
Nicolas
08387c06f8 Update auth.ts 2024-08-28 14:09:12 -03:00
rafaelsideguide
ef2d8d012b Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:07:31 -03:00
rafaelsideguide
5cbf0dcaf5 fix(v1): includeTags 2024-08-28 14:07:28 -03:00
Nicolas
7ba3f3bd1a Nick: 2024-08-28 13:54:39 -03:00
Nicolas
9a13034095 Nick: 2024-08-28 13:46:39 -03:00
Nicolas
ecd07be49e Nick: fixed issues 2024-08-28 13:17:22 -03:00
Nicolas
4003d37fbc Nick: 2024-08-28 12:46:59 -03:00
Nicolas
d872bf0c4c Merge branch 'main' into v1-webscraper 2024-08-28 12:42:23 -03:00
Nicolas
9a43c6cda1 Nick: 2024-08-27 20:02:50 -03:00
Nicolas
7dff5cdf49 Nick: fixed sdk types and map preview 2024-08-27 20:02:39 -03:00
Nicolas
38ed845b93 Nick: fixed v0 backwards compatibility node sdk types 2024-08-27 19:10:58 -03:00
Nicolas
e7d283c4ef Update job-priority.ts 2024-08-27 17:04:04 -03:00
Nicolas
ca34f1203b Nick: bucket limit increase 2024-08-27 17:03:46 -03:00
Nicolas
f0dfcd6a49 Update job-priority.ts 2024-08-27 16:58:28 -03:00
Nicolas
080240e040 Merge branch 'main' into nsc/job-priority 2024-08-27 16:18:02 -03:00
Nicolas
88239d7211 Merge branch 'main' into v1-webscraper 2024-08-27 16:16:40 -03:00
Nicolas
8c8d0602bb Update rate-limiter.test.ts 2024-08-27 15:38:01 -03:00
Nicolas
bd84e49a77 Update rate-limiter.ts 2024-08-27 15:37:36 -03:00
Nicolas
a0f9a81ee6 Nick: 2024-08-27 15:36:46 -03:00
Nicolas
c009013ff6 Nick: expire tests 2024-08-27 15:26:43 -03:00
rafaelsideguide
43f6c0a10f Update example.ts 2024-08-27 15:08:29 -03:00
Nicolas
8b53285a93 Update job-priority.ts 2024-08-27 15:06:13 -03:00
Nicolas
06b70a47e0 Update job-priority.ts 2024-08-27 15:04:10 -03:00
Nicolas
1e08e6d317 Merge branch 'main' into nsc/job-priority 2024-08-27 15:01:58 -03:00
Nicolas
861e2ebdf1 Nick: 2x rate limits
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-27 12:08:50 -03:00
Nicolas
170a8ebfe5 Nick: 2024-08-27 11:58:42 -03:00
Nicolas
3850b000f0 Nick: removing credit notification for now 2024-08-27 11:14:04 -03:00
Nicolas
d30119707f Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-27 11:12:00 -03:00
Nicolas
463d38f77e Update map.ts 2024-08-27 11:11:59 -03:00
rafaelsideguide
1ef41b92a0 feat: cancel
v0 implementation + e2e test
2024-08-27 09:42:55 -03:00
Nicolas
fb553a020d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 19:57:28 -03:00
Nicolas
6ab6ef9004 Update auth.ts 2024-08-26 19:57:27 -03:00
rafaelsideguide
adc3e4233d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 19:22:05 -03:00
rafaelsideguide
65faa3e163 tests/feat: url validation 2024-08-26 19:22:03 -03:00
Nicolas
558acffb33 Nick: @rafaelsideguide isarray for includes/excludes 2024-08-26 19:07:14 -03:00
Nicolas
7d93eab0f8 Nick: 2024-08-26 18:48:00 -03:00
rafaelsideguide
72454de18d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 18:21:54 -03:00
rafaelsideguide
04556ded40 tests: e2e for crawl and crawl status 2024-08-26 18:21:52 -03:00
Nicolas
8c37ea6d96 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 18:17:17 -03:00
Nicolas
f277a0e2bb Update package.json 2024-08-26 18:17:09 -03:00
rafaelsideguide
f2f6f78dcf fix(url validation): sub paths 2024-08-26 18:12:03 -03:00
Nicolas
0bbb8bb24e Nick: 2024-08-26 17:17:12 -03:00
Nicolas
98a770f38f Nick: rm wip 2024-08-26 17:16:44 -03:00
Nicolas
6f68678b5d Nick: 2024-08-26 17:13:00 -03:00
Nicolas
b0bd71a3a9 Merge branch 'main' into v1-webscraper 2024-08-26 16:58:00 -03:00
Nicolas
2d78c20d68 Nick: 2024-08-26 16:56:27 -03:00
Nicolas
fa7dc5b10b Update rate-limiter.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-26 16:33:34 -03:00
Nicolas
4d0acc9722 Merge branch 'main' into v1-webscraper 2024-08-26 16:22:05 -03:00
Nicolas
5606fe5870 Nick: 2024-08-26 16:05:11 -03:00
rafaelsideguide
1baba3ce0a fix(go-sdk): submodules 2024-08-26 11:11:34 -03:00
Gergo Moricz
d591e0f51c block corterix.com for performance issues
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-25 20:06:12 +02:00
rafaelsideguide
6f9a2687ae fixed turndown bug 2024-08-25 15:04:32 -03:00
Gergo Moricz
96e91ab9ec convert webhook call to v1 2024-08-25 14:05:46 +02:00
Nicolas
1f99bfd3c8 Update queue.ts
Some checks failed
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Simple Autoscaler / scale (push) Has been cancelled
2024-08-23 22:47:12 -03:00
Nicolas
b80277d4de Update queue.ts 2024-08-23 22:46:44 -03:00
Nicolas
d87b62fed9 Nick: 2024-08-23 22:33:17 -03:00
Nicolas
b9e06e27f4 Update queue.ts 2024-08-23 22:17:27 -03:00
Nicolas
8e78511ed4 Update queue.ts 2024-08-23 22:15:47 -03:00
Nicolas
28d7a637c2 Update queue.ts 2024-08-23 22:07:49 -03:00
Nicolas
173f4ee1bf Nick: chrome cdp main | simple autoscaler 2024-08-23 20:09:59 -03:00
Gergő Móricz
064ebfc54d fix websocket 2024-08-23 19:55:41 +02:00
Gergő Móricz
05c250d3b8 Merge branch 'main' into v1-webscraper 2024-08-23 19:38:57 +02:00
Gergő Móricz
2ab0dd2e15 fix(scrape): add further llm extraction catch 2024-08-23 19:20:17 +02:00
Gergő Móricz
1054a1397b Merge branch 'main' into v1-webscraper 2024-08-23 19:14:49 +02:00
Nicolas
3d53f4e213 Nick: unblocking pin 2024-08-23 13:56:05 -03:00
Gergő Móricz
5ef3926d2a fix(scrape,search): handle failed jobs 2024-08-23 18:47:56 +02:00
Gergő Móricz
866e71910c further fixes 2024-08-23 18:27:00 +02:00
Gergő Móricz
eea530e0ad feat(v1): update for sentry 2024-08-23 17:29:42 +02:00
Gergő Móricz
e7f267b6fe Merge branch 'main' into v1-webscraper 2024-08-23 17:21:54 +02:00
Gergő Móricz
52a05b8c6e rename "dragonfly" to "redis" 2024-08-23 17:05:59 +02:00
Gergő Móricz
64e9be0cd4 feat(redis): use bitnami image
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 23:38:04 +02:00
Gergő Móricz
8d9ff90bcb feat(fire-engine): propagate sentry trace 2024-08-22 23:38:04 +02:00
rafaelsideguide
74ea820bc6 fix: url and check for metadata 2024-08-22 18:32:19 -03:00
Nicolas
1f0abacadf Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-22 18:30:54 -03:00
Nicolas
1f779e261a Update rate-limiter.ts 2024-08-22 18:30:45 -03:00
Gergő Móricz
8e3c2b2855 fix(crawler): verify URL 2024-08-22 23:30:19 +02:00
Gergő Móricz
e690a6fda7 fix: remove QueueEvents 2024-08-22 22:38:39 +02:00
Gergő Móricz
76c8e9f996 fix
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 22:24:24 +02:00
Gergő Móricz
ad82175fb8 fix(scrape): poll 2024-08-22 22:12:02 +02:00
rafaelsideguide
5f60a55967 workflow and npm now running v1 tests 2024-08-22 15:28:49 -03:00
rafaelsideguide
30e809966f Merge remote-tracking branch 'origin/v1/python-sdk' into v1-webscraper 2024-08-22 15:18:05 -03:00
rafaelsideguide
a37681bdff fix: replace jest, removed map for v0 2024-08-22 15:16:46 -03:00
rafaelsideguide
7473b74021 fix: html and rawlhtmls for pdfs 2024-08-22 15:15:45 -03:00
Gergő Móricz
dd737f1235 feat(sentry): add queue instrumentation to 2024-08-22 19:17:51 +02:00
Nicolas
d2521612b4 Update .gitignore 2024-08-22 14:15:19 -03:00
Gergő Móricz
7265ab7c67 fix(search): filter docs properly 2024-08-22 18:46:56 +02:00
rafaelsideguide
b1d61d8557 Merge remote-tracking branch 'origin/v1-webscraper' into v1/python-sdk 2024-08-22 13:39:09 -03:00
rafaelsideguide
ab88a75c70 fixes sdks 2024-08-22 13:38:34 -03:00
Gergő Móricz
d036738da0 fix(bullmq): duplicate redis connection for QueueEvents 2024-08-22 18:04:09 +02:00
Gergő Móricz
6d48dbcd38 feat(sentry): add trace continuity for queue 2024-08-22 16:47:38 +02:00
Gergő Móricz
6d92b8524d feat(scrape): record job result in span 2024-08-22 16:00:13 +02:00
Gergő Móricz
5ca36fe9fc feat(api): add more captureExceptions 2024-08-22 15:49:16 +02:00
Gergő Móricz
0e8fd6ce70 fix(scrape): ensure extractionSchema is an object if llm-extraction is specified 2024-08-22 14:50:51 +02:00
Gergő Móricz
4bd2ff26d3 fix(llm-extract): pass stacktrace properly 2024-08-22 14:37:09 +02:00
Gergő Móricz
e4adbaa88e fix(llm-extract): handle llm-extract if scrape failed
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 14:12:52 +02:00
Gergő Móricz
670d253a8c fix(auth): fix error reporting 2024-08-22 14:08:09 +02:00
Gergő Móricz
7d9f5bf8b1 fix(crawl): don't use sitemap if it's empty
Fixes FIRECRAWL-SCRAPER-JS-11
2024-08-22 13:41:33 +02:00
Gergő Móricz
1f580deefc fix(crawl): validate includes.excludes regexes 2024-08-22 13:29:11 +02:00
Gergő Móricz
fbbc3878f1 fix(crawler): make sure includes/excludes is an array 2024-08-22 13:18:26 +02:00
Gergő Móricz
508568f943 fix(search): handle scrape timeouts on search
Fixes FIRECRAWL-SCRAPER-JS-15
2024-08-22 13:10:58 +02:00
Gergő Móricz
14fa75cae6 fix(crawl): send error if url is not a string
Fixes FIRECRAWL-SCRAPER-JS-1E and FIRECRAWL-SCRAPER-JS-Z
2024-08-22 13:09:08 +02:00
Nicolas
8a778278a9 Merge branch 'main' into nsc/job-priority 2024-08-21 22:57:55 -03:00
Gergo Moricz
0cdf41587e feat(sentry): add error handles to try-catch blocks
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 03:55:40 +02:00
Nicolas
53ca704620 Update index.ts 2024-08-21 22:55:39 -03:00
Nicolas
477c3257dc Nick: 2024-08-21 22:53:33 -03:00
Nicolas
c7bfe4ffe8 Nick: 2024-08-21 22:20:40 -03:00
Nicolas
6bdb1d045d Merge branch 'main' into nsc/job-priority 2024-08-21 21:52:05 -03:00
Nicolas
e78d2af1f0 Nick: 2024-08-21 21:51:54 -03:00
Nicolas
e64d3815ea Merge branch 'main' into nsc/job-priority 2024-08-21 20:54:57 -03:00
Nicolas
0ea0a5db46 Nick: wip 2024-08-21 20:54:39 -03:00
rafaelsideguide
a4686e3c8c fixing tests 2024-08-21 15:56:48 -03:00
rafaelsideguide
fe2e8c0b7a includehtml fix 2024-08-21 15:54:00 -03:00
Gergő Móricz
629da74a5c fix(sentry): decrease tracesSampleRate
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-21 20:51:35 +02:00
Gergő Móricz
55009e51f5 fix: filter out invalid URLs from crawl links 2024-08-21 20:49:25 +02:00
Gergő Móricz
dae1408e66 fix(Dockerfile): retain sentry auth token properly 2024-08-21 20:40:42 +02:00
Gergő Móricz
ac9783ed2f fix(sentry): adjust profiles sample rate to be even lower 2024-08-21 20:21:16 +02:00
Gergő Móricz
9579f03c4b fix: import resolution 2024-08-21 20:16:06 +02:00
Gergő Móricz
6104d74213 fix(sentry): drop profiling sample rate 2024-08-21 20:12:47 +02:00
Gergő Móricz
3d5dc9d90a feat(sentry): add log + server name 2024-08-21 19:39:10 +02:00
Gergő Móricz
85ff0c311e Add worker ID to job attribute 2024-08-21 19:21:29 +02:00
Gergő Móricz
920702cdde Update builder to handle uploading sourcemaps 2024-08-21 19:08:03 +02:00
Gergő Móricz
86942728e3 Add metadata for queue-worker and Express 2024-08-21 17:58:27 +02:00
Nicolas
35decb1af2 Nick: 2024-08-21 12:35:03 -03:00
rafaelsideguide
af0e47a30e Merge remote-tracking branch 'origin/v1/node-sdk' into v1/python-sdk 2024-08-21 12:09:53 -03:00
rafaelsideguide
52abec41c2 fixing delete 2024-08-21 10:35:50 -03:00
Nicolas
db8c84ff0f Update requests.http 2024-08-21 10:19:37 -03:00
rafaelsideguide
b66553867e reverting delete, fixed express bug on checkCredits 2024-08-21 09:28:20 -03:00
rafaelsideguide
138437d616 commenting out delete, crashing on fire-engine
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-21 08:11:24 -03:00
rafaelsideguide
5e48bec1fd commenting out delete, crashing on fire-engine 2024-08-21 08:10:46 -03:00
Nicolas
90b32f16c8 Nick: fixes 2024-08-20 21:38:11 -03:00
Nicolas
819ad50af3 Update fireEngine.ts 2024-08-20 21:16:33 -03:00
rafaelsideguide
e9d6ca197e tests passing now 2024-08-20 20:00:41 -03:00
Nicolas
1b3ad60a2c Reapply "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
This reverts commit ffe11a5bf7.
2024-08-20 19:22:09 -03:00
Nicolas
441628998f Reapply "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
This reverts commit ffe11a5bf7.
2024-08-20 19:16:48 -03:00
Nicolas
ffe11a5bf7 Revert "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
This reverts commit 2030ec6031, reversing
changes made to f494d2b707.
2024-08-20 18:16:11 -03:00
Gergő Móricz
5818236659 fix: remove rawHtml properly 2024-08-20 22:51:12 +02:00
rafaelsideguide
0f48823c9c Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 17:24:59 -03:00
rafaelsideguide
9d64c8eedc screenshot should not be undefined also 2024-08-20 17:24:45 -03:00
Gergő Móricz
1368f9a87f fix: treat existing screenshot as a scraper success condition 2024-08-20 22:24:18 +02:00
rafaelsideguide
70d81ca69e Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 17:08:07 -03:00
rafaelsideguide
537fc689b6 fixing request 2024-08-20 17:08:02 -03:00
rafaelsideguide
f98be7d94e Update fireEngine.ts 2024-08-20 16:53:01 -03:00
Nicolas
0c48c8a436 Nick: billing for map 2024-08-20 16:43:46 -03:00
Gergő Móricz
f494d2b707 prioritize search lower 2024-08-20 21:18:31 +02:00
Nicolas
59eb552df5
Merge pull request #560 from mendableai/feat/cancel-fire-engine
[Feat] Added check job and cancel to fire-engine requests
2024-08-20 15:43:31 -03:00
rafaelsideguide
1f27182a13 added try catch 2024-08-20 15:42:39 -03:00
Gergő Móricz
efb91f9ca5 fix(search): delete jobs after done 2024-08-20 20:29:08 +02:00
Nicolas
39388cdc35 Update crawl.ts 2024-08-20 14:41:43 -03:00
Nicolas
674adee144 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-20 14:41:05 -03:00
Nicolas
b36faeaf54 Nick: 2024-08-20 14:39:52 -03:00
Gergő Móricz
cf32893c2e add strict enforcement + move crawlerOptions to top-level in /crawl 2024-08-20 19:31:26 +02:00
rafaelsideguide
e326249a57 added check job and cancel to fire-engine requests 2024-08-20 14:26:42 -03:00
Gergő Móricz
70d50b3640 fix(queue-worker): move dotenv config up 2024-08-20 19:25:19 +02:00
Nicolas
c5ad4dedeb Update crawl.ts 2024-08-20 14:19:20 -03:00
Nicolas
de0dc20a02 Update credit_billing.ts 2024-08-20 14:18:14 -03:00
Nicolas
5abd26a267 Nick: set the crawl limit to the remaining credits 2024-08-20 14:16:54 -03:00
Nicolas
e200ec9e12 Nick: 2024-08-20 12:24:14 -03:00
Nicolas
55dad82df1 Nick: fixed map search 2024-08-20 12:17:53 -03:00
Nicolas
27903247b6 Nick: map tests and fixes 2024-08-20 12:04:08 -03:00
rafaelsideguide
fa89d2e535 v1 support for crawl/monitor status 2024-08-20 10:37:24 -03:00
rafaelsideguide
7727302ef1 Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 09:22:55 -03:00
rafaelsideguide
e160d5529a fixed test 2024-08-20 09:22:38 -03:00
rafaelsideguide
e1c9cbf709 bug fixed. crawl should not stop if sitemap url is invalid
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-20 09:11:58 -03:00
rafaelsideguide
ecd472356b added variables to beta customers 2024-08-19 16:41:54 -03:00
Nicolas
3dc298be54 Nick: 2x rate limits for standard and growth for /scrape 2024-08-19 13:52:54 -03:00
rafaelsideguide
32aba44167 fixing merge issues 2024-08-19 13:37:20 -03:00
rafaelsideguide
72461ce9a6 Update index.test.ts 2024-08-19 13:29:52 -03:00
rafaelsideguide
fd7fdc1d52 added blocklist middleware 2024-08-19 13:28:54 -03:00