Commit Graph

1703 Commits

Author SHA1 Message Date
Gergő Móricz
d30356a22c fix(js-sdk): infer keyword collision 2024-09-12 19:57:33 +02:00
Gergő Móricz
d497284b40 feat(api/queue): auto-remove completed jobs after 25 hours 2024-09-12 19:47:22 +02:00
Nicolas
620b02f9ca Nick: 2024-09-12 12:51:14 -04:00
Gergő Móricz
a2903e75cf feat(js-sdk): type-safe LLM extract 2024-09-12 18:48:19 +02:00
Nicolas
e1171ade1b Merge branch 'main' into mog/fix-status-job-get 2024-09-12 11:35:36 -04:00
Nicolas
503c8b3efa Update package-lock.json 2024-09-12 11:35:26 -04:00
Nicolas
e19dc68718 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-11 14:59:40 -04:00
Nicolas
99c1af0a9f Update package.json 2024-09-11 14:59:36 -04:00
Gergő Móricz
5adfd74cc5 feat(js-sdk/test): add API_URL env var 2024-09-11 20:32:34 +02:00
Gergő Móricz
ad70c30be5 fix(js-sdk): check at bad if 2024-09-11 20:31:58 +02:00
Gergő Móricz
6e1cf2f40d feat(js-sdk): fixes, update tests 2024-09-11 20:15:43 +02:00
Nicolas
5791d3a0e5 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-11 14:03:36 -04:00
Nicolas
4cd1065ae2 Update rate-limiter.ts 2024-09-11 14:03:34 -04:00
Gergő Móricz
51da038405
Merge branch 'main' into fix-attw 2024-09-11 19:45:13 +02:00
Gergő Móricz
75f4bcd2ad
Merge pull request #614 from MonsterDeveloper/dependencies
perf(js-sdk): move `dotenv` and `uuid` to `devDependencies`, fix `zod` import
2024-09-11 19:34:50 +02:00
Andrei Bobkov
f6fc71b46a
fix(js-sdk): bring back cjs exports 2024-09-11 17:53:17 +03:00
Gergő Móricz
97ffabff3a fix(v1): converting bad docs always gives null 2024-09-10 21:21:20 +02:00
Gergő Móricz
83a165db0f fix(v0/scrape): ensure url is string 2024-09-10 21:18:53 +02:00
Gergő Móricz
ad1a6fbc74 fix(v1/map): handle invalid URLs gracefully 2024-09-10 19:41:01 +02:00
Gergő Móricz
e19f7a102e feat(js-sdk): paginate next on checkCrawlStatus + better types for CSR 2024-09-10 19:29:38 +02:00
Gergő Móricz
4ebc35c9dd fix(crawl-status): add success: true 2024-09-10 18:59:09 +02:00
rafaelsideguide
f855ad3436 bumping py-sdk version 2024-09-10 10:29:44 -03:00
rafaelsideguide
ee8a54213c fix(py-sdk): removed asyncio package
tested websocket with example.py without asyncio and it works with no problem.
2024-09-10 10:25:27 -03:00
rafaelsideguide
1074e976cd Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-10 09:09:43 -03:00
rafaelsideguide
45237a29dd updated js-sdk examples 2024-09-10 09:09:39 -03:00
Nicolas
a7406031fa Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-10 06:53:30 -03:00
Nicolas
a17e1cac92 Rate bump 2024-09-10 06:53:24 -03:00
Gergo Moricz
b4dbf75537 fix(v1): check if url is string in blocklistMiddleware
Fixes FIRECRAWL-SCRAPER-JS-9Z
2024-09-10 10:25:14 +02:00
Gergo Moricz
26f2095de6 fix(v1): proper Invalid URL handling 2024-09-10 09:24:23 +02:00
Gergo Moricz
f8fbc71f91 fix(supabase-jobs): do not use RPCs
RPCs are more failure-prone for this use case than regular queries are.
2024-09-10 09:20:18 +02:00
Gergo Moricz
a6bcf7b438 fix(v0/crawl-status): don't crash on big crawls when requesting jobs from supabase 2024-09-10 08:53:40 +02:00
Nicolas
17e419a7fb Nick: 2024-09-09 21:06:23 -03:00
Nicolas
ca9a781eb7 Update index.ts 2024-09-09 12:27:55 -03:00
Nicolas
22a5e85899 Update index.ts 2024-09-09 12:26:55 -03:00
Nicolas
60a15d00eb Update types.ts 2024-09-08 16:39:12 -03:00
Nicolas
fbdfa1256b Update credit_billing.ts 2024-09-08 13:07:10 -03:00
Nicolas
48c665519e Update credit_billing.ts 2024-09-07 13:42:45 -03:00
Nicolas
32097fa29d Merge branch 'main' of https://github.com/mendableai/firecrawl
Some checks failed
Fly Deploy / Pre-deploy checks (push) Has been cancelled
Clean Every 30 Minutes Before 24h Completed Jobs / clean-jobs (push) Has been cancelled
Fly Deploy Direct / Deploy app (push) Has been cancelled
Check Queues / clean-jobs (push) Has been cancelled
Fly Deploy / Test Suite (push) Has been cancelled
Fly Deploy / Python SDK Tests (push) Has been cancelled
Fly Deploy / JavaScript SDK Tests (push) Has been cancelled
Fly Deploy / Go SDK Tests (push) Has been cancelled
Fly Deploy / Rust SDK Tests (push) Has been cancelled
Fly Deploy / Deploy app (push) Has been cancelled
Fly Deploy / Build and publish Python SDK (push) Has been cancelled
Fly Deploy / Build and publish JavaScript SDK (push) Has been cancelled
Fly Deploy / Build and publish Rust SDK (push) Has been cancelled
2024-09-07 13:12:52 -03:00
Nicolas
5758af3291 Update website_params.ts 2024-09-07 13:12:46 -03:00
Nicolas
79870e7305 Update excludeTags.ts 2024-09-06 20:15:26 -03:00
Nicolas
a0f9ab2be7 Update map.ts 2024-09-06 20:14:47 -03:00
Nicolas
f5b84e15e1 Update sitemap.ts 2024-09-05 17:52:27 -03:00
rafaelsideguide
82d6bf4ec8 feat(go-parser): singleton 2024-09-05 16:14:21 -03:00
Nicolas
cb630bfc34 Update fireEngine.ts 2024-09-05 14:24:10 -03:00
rafaelsideguide
8c1097e9e1 fix: pageOptions 2024-09-05 14:16:31 -03:00
rafaelsideguide
b301ffc922 added missing variables 2024-09-05 13:57:26 -03:00
Nicolas
c6f1d80992 Update crawl.ts 2024-09-05 13:03:43 -03:00
Nicolas
eb03a81152 Update crawl-status.ts 2024-09-05 12:55:04 -03:00
Sebastjan Prachovskij
503e83e83e
Add SearchApi to search
Add support for engines, improve status code error

Remove changes in package, add engine to env params

Improve description in env example

Remove unnecessary empty line

Improve text
2024-09-05 18:36:59 +03:00
Nicolas
34adf43200 Merge branch 'main' into feat/go-html-to-md-parser 2024-09-05 12:13:14 -03:00
Nicolas
82cb80c817 Update map.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Blocked by required conditions
2024-09-04 23:46:18 -03:00
Nicolas
a0113dac37 Update credit_billing.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Blocked by required conditions
2024-09-04 16:54:20 -03:00
Nicolas
b7b99b53da Merge branch 'main' into nsc/check-credits-optimization 2024-09-04 16:39:18 -03:00
rafaelsideguide
85b824e122 test: what about false false? 2024-09-04 16:35:32 -03:00
rafaelsideguide
78edf13ec6 test: usedbauth envs wth 2024-09-04 16:31:42 -03:00
rafaelsideguide
cb8571abad fix: enforced dotenv config 2024-09-04 15:57:57 -03:00
Nicolas
5ecb243693 Nick: 2024-09-04 15:19:45 -03:00
rafaelsideguide
3f462eabe9 fix(cicd): 2024-09-04 12:27:46 -03:00
Nicolas
8fc313c1a4 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-04 12:09:36 -03:00
Nicolas
1eb993a93b Update __init__.py 2024-09-04 12:09:28 -03:00
rafaelsideguide
22bf67cc4f Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-09-04 12:05:09 -03:00
rafaelsideguide
364ba9f901 fix(cicd): mendable->firecrawl and waitfor 2024-09-04 12:05:02 -03:00
Nicolas
a2a63e42cc Rm print map 2024-09-04 12:03:58 -03:00
rafaelsideguide
ad950a6c9d fixed controller res and tests 2024-09-04 11:29:32 -03:00
rafaelsideguide
74ac8915cd details 2024-09-04 09:41:38 -03:00
rafaelsideguide
d836ba6782 added log to check response on cicd 2024-09-04 09:35:56 -03:00
Nicolas
049a11187d Nick: 2024-09-03 21:09:32 -03:00
Nicolas
3072d4a333 Nick: fixed .sort coupons and sentry to withAuth 2024-09-03 21:02:41 -03:00
Nicolas
7561fd279f Nick: debug the billing email system for free credits 2024-09-03 17:08:12 -03:00
rafaelsideguide
c5e1d77a82 added invalid html tests 2024-09-03 15:21:45 -03:00
rafaelsideguide
d60fa6e084 fixed dockerfile and function name. it's working 2024-09-03 14:08:07 -03:00
rafaelsideguide
ebf4035484 added log so we can check 2024-09-03 13:15:21 -03:00
rafaelsideguide
6ccc22ba2f fix(sdk): js next pagination
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-09-03 11:57:19 -03:00
rafaelsideguide
411d7f31c5 fix(sdks): fetch next/pagination 2024-09-03 11:56:24 -03:00
rafaelsideguide
291d9e375b now using compiled go/C lib with koffi 2024-09-03 10:56:07 -03:00
Andrei Bobkov
2b0e447bc2
perf(js-sdk): move dotenv and uuid to devDependencies 2024-09-03 11:13:48 +02:00
Andrei Bobkov
2a8f55e533
perf(js-sdk): remove whole z import and instead use type-only import 2024-09-03 11:12:28 +02:00
Andrei Bobkov
fe8f9d4b2f
feat(js-sdk): drop commonjs outputs and simplify build process 2024-09-03 10:50:52 +02:00
Andrei Bobkov
995a3ff5bb
chore(tsconfig): modernize and remove commonjs 2024-09-03 10:49:59 +02:00
Nicolas
08a9cb8db4 Merge branch 'main' into pr/516 2024-09-02 23:32:23 -03:00
Nicolas
3244454826 Nick: 2024-09-02 19:36:36 -03:00
Nicolas
2444f7c450 Update scrape.ts 2024-09-02 18:51:18 -03:00
Nicolas
00dacaf935 Nick: scrape id when origin is website for report system 2024-09-02 18:43:52 -03:00
rafaelsideguide
48056ea1bd feat: added go html to md parser 2024-09-02 14:15:56 -03:00
rafaelsideguide
036eb925a2 fix(v1): maxDepth 2024-09-02 09:55:35 -03:00
Nicolas
86b0477708 Update map.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-09-01 19:54:15 -03:00
Nicolas
304b3e752e Update map.ts 2024-09-01 19:52:21 -03:00
Nicolas
cb2dfe29be Nick: 2024-09-01 15:06:36 -03:00
Gergő Móricz
980293652d fix(queue-worker): new getJobs, log on v0 2024-09-01 19:29:35 +02:00
Nicolas
44fe741c35 Update queue-worker.ts 2024-09-01 14:19:43 -03:00
Nicolas
758f729ae2 Update queue-worker.ts 2024-09-01 14:10:47 -03:00
Nicolas
5c05bb12a7 Update webhook.ts 2024-09-01 13:58:50 -03:00
Nicolas
ae90370543 Update webhook.ts 2024-09-01 13:58:07 -03:00
Nicolas
0df2441d7f Nick: sdks good 2024-09-01 13:54:41 -03:00
Nicolas
b68a50fea3 Nick: 2024-09-01 13:52:34 -03:00
Nicolas
979697df1c Update queue-worker.ts 2024-09-01 13:47:51 -03:00
Nicolas
95b9dc915d Nick: webhooks v1 working great 2024-09-01 13:44:36 -03:00
Nicolas
faae98ecb8 Merge branch 'main' into v1/webhooks 2024-09-01 12:56:11 -03:00
Nicolas
fe6abe8f7d Nick: 2024-09-01 12:55:02 -03:00
Nicolas
2ef43d5fc2 Update rate-limiter.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-31 14:26:16 -03:00
Nicolas
8431be5826 Nick: 2024-08-31 14:23:55 -03:00
Nicolas
af5cc5f16b Nick: 1.2 - v1 llm extract 2024-08-31 13:25:48 -03:00
Nicolas
0c59564393 Merge branch 'main' into v1/llm-extract 2024-08-30 21:31:49 -03:00
Gergő Móricz
693dc14d9b remove invalid keys
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-31 00:08:24 +02:00
Nicolas
6d1da2e22a Update index.ts 2024-08-30 18:45:02 -03:00
Nicolas
d347160ff9 Nick: 2024-08-30 17:32:41 -03:00
Nicolas
522d256b11 Merge branch 'main' into v1/llm-extract 2024-08-30 17:16:21 -03:00
Nicolas
ef5c8931eb Update scrape.ts 2024-08-30 17:09:44 -03:00
Nicolas
050cac51ec Update types.ts 2024-08-30 17:02:29 -03:00
Nicolas
012bc74e14 Revert "Update types.ts"
This reverts commit 77766cf00d.
2024-08-30 17:02:05 -03:00
Nicolas
77766cf00d Update types.ts 2024-08-30 17:01:54 -03:00
rafaelsideguide
87e61f2d51 v0 working 2024-08-30 16:38:55 -03:00
rafaelsideguide
dc189e1e9d feat: webhooks config on v1 2024-08-30 16:22:59 -03:00
Nicolas
23dfe90baa Nick: 2024-08-30 15:43:31 -03:00
Nicolas
0732997a13 Merge branch 'main' into v1/llm-extract 2024-08-30 15:42:45 -03:00
Gergo Moricz
c8e0bb93ea fix(crawl-status): handle null data when purging rawHtml 2024-08-30 20:40:57 +02:00
Nicolas
4f92bb4bf4 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-30 15:29:47 -03:00
Nicolas
282962e36f Nick: 2024-08-30 15:29:41 -03:00
Gergo Moricz
6a6b487474 fix(v1): don't fail on doc = null 2024-08-30 20:28:18 +02:00
Nicolas
71dab56e36 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-30 15:21:29 -03:00
Nicolas
d7dbc2536d Update crawl.ts 2024-08-30 15:21:22 -03:00
Nicolas
9de39cfa2c
Merge pull request #591 from mendableai/mog/websocket-sdk
[v1] Websockets SDKs
2024-08-30 15:06:06 -03:00
Nicolas
234c6daee8 Update supabase-jobs.ts 2024-08-30 14:52:59 -03:00
Nicolas
7c8ca0c889 Update types.ts 2024-08-30 13:54:04 -03:00
Nicolas
f4a49bc1d1 Update types.ts 2024-08-30 13:49:16 -03:00
Nicolas
a2881e9288 Nick: llm extract support on node sdk 2024-08-30 13:43:19 -03:00
Nicolas
45e33563eb Nick: python working 2024-08-30 12:58:38 -03:00
Nicolas
bb4808443c Merge branch 'main' into v1/llm-extract 2024-08-30 12:48:26 -03:00
Nicolas
2fd9c285bd
Merge pull request #582 from mendableai/fix--rawHtml-return-on-crawl
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
[Bug] Moved delete rawHtml to end of controller
2024-08-30 12:40:06 -03:00
Nicolas
b8920d6f4a Nick: 2024-08-30 12:37:45 -03:00
Nicolas
552328d168 Merge branch 'main' into v1/llm-extract 2024-08-30 12:34:50 -03:00
Nicolas
ee3e5dc69c Nick: 2024-08-30 12:34:45 -03:00
Gergő Móricz
26d87bd099
fix(api): handle zoderrors earlier 2024-08-30 17:33:42 +02:00
Nicolas
1805d901a9 Update credit_billing.ts 2024-08-30 12:05:37 -03:00
Nicolas
e5ca4364ba Nick: improvements to llm extract error handling 2024-08-30 11:57:55 -03:00
Nicolas
52ac132328
Update auth.ts 2024-08-30 11:10:48 -03:00
Rafael Miller
636e39d366
Merge branch 'main' into mog/websocket-sdk 2024-08-30 11:09:09 -03:00
rafaelsideguide
ae38c26fa8 feat(v1-sdks): async crawl node, python websocket + async crawl + example 2024-08-30 10:09:39 -03:00
Eric Ciarla
7caaee28c5 Update ui component to v1 2024-08-29 21:40:44 -04:00
Nicolas
41eb620959 Nick: prompt option, still need to convert to new structured outputs 2024-08-29 21:00:57 -03:00
Nicolas
49e1cb7ca0 Nick: 2024-08-29 20:08:06 -03:00
Nicolas
63264644e1 Nick: fixed js-sdk map params
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Rust SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Fly Deploy / Build and publish Rust SDK (push) Waiting to run
2024-08-29 20:06:55 -03:00
rafaelsideguide
377e8ded34 removed v0 support 2024-08-29 17:40:43 -03:00
rafaelsideguide
5f11275fe7 Merge remote-tracking branch 'origin/main' into mog/websocket-sdk 2024-08-29 17:14:59 -03:00
Nicolas
db85f1a7d5 Update index.ts 2024-08-29 17:11:16 -03:00
Rafael Miller
eec6d86802
Merge branch 'main' into f/rust-sdk 2024-08-29 16:20:03 -03:00
Gergő Móricz
53018a683f feat(js-sdk): add crawlUrlAndWatch 2024-08-29 20:01:16 +02:00
Gergő Móricz
d4001e4528 fix(api/websocket): fix auth and termination 2024-08-29 20:01:00 +02:00
Nicolas
1174bc3c82 Nick: blocked sw/ra
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-29 13:55:54 -03:00
rafaelsideguide
7eefaffe48 fix: rawHtml for v0 2024-08-29 12:24:14 -03:00
rafaelsideguide
f8323f9b11 fix: moved delete rawHtml to end of controller 2024-08-29 10:02:05 -03:00
Nicolas
02410aca36 Update runWebScraper.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-28 22:39:50 -03:00
Nicolas
67229c6b3a Nick: fixed credits issue
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-28 18:32:45 -03:00
rafaelsideguide
c3158b0f98 fix(v1): js-sdk fixed crawl type 2024-08-28 16:27:29 -03:00
Nicolas
70bff7f8fb Nick: 2024-08-28 15:59:20 -03:00
Nicolas
8be75accb8 Nick: 2024-08-28 15:40:30 -03:00
rafaelsideguide
c7b3365ffd fix(v1): update readme - v1.0.1 2024-08-28 15:15:29 -03:00
Nicolas
547da88ccd Update rate-limiter.test.ts 2024-08-28 14:53:02 -03:00
Nicolas
4edd9a34c7 Nick: totalCount -> total, completedCount -> completed 2024-08-28 14:46:53 -03:00
Nicolas
2a56f89f50 Update index.test.ts 2024-08-28 14:24:26 -03:00
Nicolas
59547d7129 Update index.test.ts 2024-08-28 14:22:26 -03:00
rafaelsideguide
b7f62f93c4 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:18:07 -03:00
rafaelsideguide
1c35534e2a fix(v1): check-status for preview 2024-08-28 14:18:05 -03:00
Nicolas
af67b14c9d Nick: 2024-08-28 14:17:59 -03:00
Nicolas
8fbff28009 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:09:20 -03:00
Nicolas
08387c06f8 Update auth.ts 2024-08-28 14:09:12 -03:00
rafaelsideguide
ef2d8d012b Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:07:31 -03:00
rafaelsideguide
5cbf0dcaf5 fix(v1): includeTags 2024-08-28 14:07:28 -03:00
Nicolas
7ba3f3bd1a Nick: 2024-08-28 13:54:39 -03:00
Nicolas
9a13034095 Nick: 2024-08-28 13:46:39 -03:00
Nicolas
ecd07be49e Nick: fixed issues 2024-08-28 13:17:22 -03:00
Nicolas
4003d37fbc Nick: 2024-08-28 12:46:59 -03:00
Nicolas
d872bf0c4c Merge branch 'main' into v1-webscraper 2024-08-28 12:42:23 -03:00
Nicolas
9a43c6cda1 Nick: 2024-08-27 20:02:50 -03:00
Nicolas
7dff5cdf49 Nick: fixed sdk types and map preview 2024-08-27 20:02:39 -03:00
Nicolas
38ed845b93 Nick: fixed v0 backwards compatibility node sdk types 2024-08-27 19:10:58 -03:00
Nicolas
e7d283c4ef Update job-priority.ts 2024-08-27 17:04:04 -03:00
Nicolas
ca34f1203b Nick: bucket limit increase 2024-08-27 17:03:46 -03:00
Nicolas
f0dfcd6a49 Update job-priority.ts 2024-08-27 16:58:28 -03:00
Nicolas
080240e040 Merge branch 'main' into nsc/job-priority 2024-08-27 16:18:02 -03:00
Nicolas
88239d7211 Merge branch 'main' into v1-webscraper 2024-08-27 16:16:40 -03:00
Nicolas
8c8d0602bb Update rate-limiter.test.ts 2024-08-27 15:38:01 -03:00
Nicolas
bd84e49a77 Update rate-limiter.ts 2024-08-27 15:37:36 -03:00
Nicolas
a0f9a81ee6 Nick: 2024-08-27 15:36:46 -03:00
Nicolas
c009013ff6 Nick: expire tests 2024-08-27 15:26:43 -03:00
rafaelsideguide
43f6c0a10f Update example.ts 2024-08-27 15:08:29 -03:00
Nicolas
8b53285a93 Update job-priority.ts 2024-08-27 15:06:13 -03:00
Nicolas
06b70a47e0 Update job-priority.ts 2024-08-27 15:04:10 -03:00
Nicolas
1e08e6d317 Merge branch 'main' into nsc/job-priority 2024-08-27 15:01:58 -03:00
Nicolas
861e2ebdf1 Nick: 2x rate limits
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-27 12:08:50 -03:00
Nicolas
170a8ebfe5 Nick: 2024-08-27 11:58:42 -03:00
Nicolas
3850b000f0 Nick: removing credit notification for now 2024-08-27 11:14:04 -03:00
Nicolas
d30119707f Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-27 11:12:00 -03:00
Nicolas
463d38f77e Update map.ts 2024-08-27 11:11:59 -03:00
rafaelsideguide
1ef41b92a0 feat: cancel
v0 implementation + e2e test
2024-08-27 09:42:55 -03:00
Nicolas
fb553a020d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 19:57:28 -03:00
Nicolas
6ab6ef9004 Update auth.ts 2024-08-26 19:57:27 -03:00
rafaelsideguide
adc3e4233d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 19:22:05 -03:00
rafaelsideguide
65faa3e163 tests/feat: url validation 2024-08-26 19:22:03 -03:00
Nicolas
558acffb33 Nick: @rafaelsideguide isarray for includes/excludes 2024-08-26 19:07:14 -03:00
Nicolas
7d93eab0f8 Nick: 2024-08-26 18:48:00 -03:00
rafaelsideguide
72454de18d Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 18:21:54 -03:00
rafaelsideguide
04556ded40 tests: e2e for crawl and crawl status 2024-08-26 18:21:52 -03:00
Nicolas
8c37ea6d96 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-26 18:17:17 -03:00
Nicolas
f277a0e2bb Update package.json 2024-08-26 18:17:09 -03:00
rafaelsideguide
f2f6f78dcf fix(url validation): sub paths 2024-08-26 18:12:03 -03:00
Nicolas
0bbb8bb24e Nick: 2024-08-26 17:17:12 -03:00
Nicolas
98a770f38f Nick: rm wip 2024-08-26 17:16:44 -03:00
Nicolas
6f68678b5d Nick: 2024-08-26 17:13:00 -03:00
Nicolas
b0bd71a3a9 Merge branch 'main' into v1-webscraper 2024-08-26 16:58:00 -03:00
Nicolas
2d78c20d68 Nick: 2024-08-26 16:56:27 -03:00
Nicolas
fa7dc5b10b Update rate-limiter.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-26 16:33:34 -03:00
Nicolas
4d0acc9722 Merge branch 'main' into v1-webscraper 2024-08-26 16:22:05 -03:00
Nicolas
5606fe5870 Nick: 2024-08-26 16:05:11 -03:00
rafaelsideguide
1baba3ce0a fix(go-sdk): submodules 2024-08-26 11:11:34 -03:00
Gergo Moricz
d591e0f51c block corterix.com for performance issues
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-25 20:06:12 +02:00
rafaelsideguide
6f9a2687ae fixed turndown bug 2024-08-25 15:04:32 -03:00
Gergo Moricz
96e91ab9ec convert webhook call to v1 2024-08-25 14:05:46 +02:00
Nicolas
1f99bfd3c8 Update queue.ts
Some checks failed
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Simple Autoscaler / scale (push) Has been cancelled
2024-08-23 22:47:12 -03:00
Nicolas
b80277d4de Update queue.ts 2024-08-23 22:46:44 -03:00
Nicolas
d87b62fed9 Nick: 2024-08-23 22:33:17 -03:00
Nicolas
b9e06e27f4 Update queue.ts 2024-08-23 22:17:27 -03:00
Nicolas
8e78511ed4 Update queue.ts 2024-08-23 22:15:47 -03:00
Nicolas
28d7a637c2 Update queue.ts 2024-08-23 22:07:49 -03:00
Nicolas
173f4ee1bf Nick: chrome cdp main | simple autoscaler 2024-08-23 20:09:59 -03:00
Gergő Móricz
064ebfc54d fix websocket 2024-08-23 19:55:41 +02:00
Gergő Móricz
05c250d3b8 Merge branch 'main' into v1-webscraper 2024-08-23 19:38:57 +02:00
Gergő Móricz
2ab0dd2e15 fix(scrape): add further llm extraction catch 2024-08-23 19:20:17 +02:00
Gergő Móricz
1054a1397b Merge branch 'main' into v1-webscraper 2024-08-23 19:14:49 +02:00
Nicolas
3d53f4e213 Nick: unblocking pin 2024-08-23 13:56:05 -03:00
Gergő Móricz
5ef3926d2a fix(scrape,search): handle failed jobs 2024-08-23 18:47:56 +02:00
Gergő Móricz
866e71910c further fixes 2024-08-23 18:27:00 +02:00
Gergő Móricz
eea530e0ad feat(v1): update for sentry 2024-08-23 17:29:42 +02:00
Gergő Móricz
e7f267b6fe Merge branch 'main' into v1-webscraper 2024-08-23 17:21:54 +02:00
Gergő Móricz
52a05b8c6e rename "dragonfly" to "redis" 2024-08-23 17:05:59 +02:00
Gergő Móricz
64e9be0cd4 feat(redis): use bitnami image
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 23:38:04 +02:00
Gergő Móricz
8d9ff90bcb feat(fire-engine): propagate sentry trace 2024-08-22 23:38:04 +02:00
rafaelsideguide
74ea820bc6 fix: url and check for metadata 2024-08-22 18:32:19 -03:00
Nicolas
1f0abacadf Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-08-22 18:30:54 -03:00
Nicolas
1f779e261a Update rate-limiter.ts 2024-08-22 18:30:45 -03:00
Gergő Móricz
8e3c2b2855 fix(crawler): verify URL 2024-08-22 23:30:19 +02:00
Gergő Móricz
e690a6fda7 fix: remove QueueEvents 2024-08-22 22:38:39 +02:00
Gergő Móricz
76c8e9f996 fix
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 22:24:24 +02:00
Gergő Móricz
ad82175fb8 fix(scrape): poll 2024-08-22 22:12:02 +02:00
rafaelsideguide
5f60a55967 workflow and npm now running v1 tests 2024-08-22 15:28:49 -03:00
rafaelsideguide
30e809966f Merge remote-tracking branch 'origin/v1/python-sdk' into v1-webscraper 2024-08-22 15:18:05 -03:00
rafaelsideguide
a37681bdff fix: replace jest, removed map for v0 2024-08-22 15:16:46 -03:00
rafaelsideguide
7473b74021 fix: html and rawlhtmls for pdfs 2024-08-22 15:15:45 -03:00
Gergő Móricz
dd737f1235 feat(sentry): add queue instrumentation to 2024-08-22 19:17:51 +02:00
Nicolas
d2521612b4 Update .gitignore 2024-08-22 14:15:19 -03:00
Gergő Móricz
7265ab7c67 fix(search): filter docs properly 2024-08-22 18:46:56 +02:00
rafaelsideguide
b1d61d8557 Merge remote-tracking branch 'origin/v1-webscraper' into v1/python-sdk 2024-08-22 13:39:09 -03:00
rafaelsideguide
ab88a75c70 fixes sdks 2024-08-22 13:38:34 -03:00
Gergő Móricz
d036738da0 fix(bullmq): duplicate redis connection for QueueEvents 2024-08-22 18:04:09 +02:00
Gergő Móricz
6d48dbcd38 feat(sentry): add trace continuity for queue 2024-08-22 16:47:38 +02:00
Gergő Móricz
6d92b8524d feat(scrape): record job result in span 2024-08-22 16:00:13 +02:00
Gergő Móricz
5ca36fe9fc feat(api): add more captureExceptions 2024-08-22 15:49:16 +02:00
Gergő Móricz
0e8fd6ce70 fix(scrape): ensure extractionSchema is an object if llm-extraction is specified 2024-08-22 14:50:51 +02:00
Gergő Móricz
4bd2ff26d3 fix(llm-extract): pass stacktrace properly 2024-08-22 14:37:09 +02:00
Gergő Móricz
e4adbaa88e fix(llm-extract): handle llm-extract if scrape failed
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 14:12:52 +02:00
Gergő Móricz
670d253a8c fix(auth): fix error reporting 2024-08-22 14:08:09 +02:00
Gergő Móricz
7d9f5bf8b1 fix(crawl): don't use sitemap if it's empty
Fixes FIRECRAWL-SCRAPER-JS-11
2024-08-22 13:41:33 +02:00
Gergő Móricz
1f580deefc fix(crawl): validate includes.excludes regexes 2024-08-22 13:29:11 +02:00
Gergő Móricz
fbbc3878f1 fix(crawler): make sure includes/excludes is an array 2024-08-22 13:18:26 +02:00
Gergő Móricz
508568f943 fix(search): handle scrape timeouts on search
Fixes FIRECRAWL-SCRAPER-JS-15
2024-08-22 13:10:58 +02:00
Gergő Móricz
14fa75cae6 fix(crawl): send error if url is not a string
Fixes FIRECRAWL-SCRAPER-JS-1E and FIRECRAWL-SCRAPER-JS-Z
2024-08-22 13:09:08 +02:00
Nicolas
8a778278a9 Merge branch 'main' into nsc/job-priority 2024-08-21 22:57:55 -03:00
Gergo Moricz
0cdf41587e feat(sentry): add error handles to try-catch blocks
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-22 03:55:40 +02:00
Nicolas
53ca704620 Update index.ts 2024-08-21 22:55:39 -03:00
Nicolas
477c3257dc Nick: 2024-08-21 22:53:33 -03:00
Nicolas
c7bfe4ffe8 Nick: 2024-08-21 22:20:40 -03:00
Nicolas
6bdb1d045d Merge branch 'main' into nsc/job-priority 2024-08-21 21:52:05 -03:00
Nicolas
e78d2af1f0 Nick: 2024-08-21 21:51:54 -03:00
Nicolas
e64d3815ea Merge branch 'main' into nsc/job-priority 2024-08-21 20:54:57 -03:00
Nicolas
0ea0a5db46 Nick: wip 2024-08-21 20:54:39 -03:00
rafaelsideguide
a4686e3c8c fixing tests 2024-08-21 15:56:48 -03:00
rafaelsideguide
fe2e8c0b7a includehtml fix 2024-08-21 15:54:00 -03:00
Gergő Móricz
629da74a5c fix(sentry): decrease tracesSampleRate
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-21 20:51:35 +02:00
Gergő Móricz
55009e51f5 fix: filter out invalid URLs from crawl links 2024-08-21 20:49:25 +02:00
Gergő Móricz
dae1408e66 fix(Dockerfile): retain sentry auth token properly 2024-08-21 20:40:42 +02:00
Gergő Móricz
ac9783ed2f fix(sentry): adjust profiles sample rate to be even lower 2024-08-21 20:21:16 +02:00
Gergő Móricz
9579f03c4b fix: import resolution 2024-08-21 20:16:06 +02:00
Gergő Móricz
6104d74213 fix(sentry): drop profiling sample rate 2024-08-21 20:12:47 +02:00
Gergő Móricz
3d5dc9d90a feat(sentry): add log + server name 2024-08-21 19:39:10 +02:00
Gergő Móricz
85ff0c311e Add worker ID to job attribute 2024-08-21 19:21:29 +02:00
Gergő Móricz
920702cdde Update builder to handle uploading sourcemaps 2024-08-21 19:08:03 +02:00
Gergő Móricz
86942728e3 Add metadata for queue-worker and Express 2024-08-21 17:58:27 +02:00
Nicolas
35decb1af2 Nick: 2024-08-21 12:35:03 -03:00
rafaelsideguide
af0e47a30e Merge remote-tracking branch 'origin/v1/node-sdk' into v1/python-sdk 2024-08-21 12:09:53 -03:00
rafaelsideguide
52abec41c2 fixing delete 2024-08-21 10:35:50 -03:00
Nicolas
db8c84ff0f Update requests.http 2024-08-21 10:19:37 -03:00
rafaelsideguide
b66553867e reverting delete, fixed express bug on checkCredits 2024-08-21 09:28:20 -03:00
rafaelsideguide
138437d616 commenting out delete, crashing on fire-engine
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-21 08:11:24 -03:00
rafaelsideguide
5e48bec1fd commenting out delete, crashing on fire-engine 2024-08-21 08:10:46 -03:00
Nicolas
90b32f16c8 Nick: fixes 2024-08-20 21:38:11 -03:00
Nicolas
819ad50af3 Update fireEngine.ts 2024-08-20 21:16:33 -03:00
rafaelsideguide
e9d6ca197e tests passing now 2024-08-20 20:00:41 -03:00
Nicolas
1b3ad60a2c Reapply "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
This reverts commit ffe11a5bf7.
2024-08-20 19:22:09 -03:00
Nicolas
441628998f Reapply "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
This reverts commit ffe11a5bf7.
2024-08-20 19:16:48 -03:00
Nicolas
ffe11a5bf7 Revert "Merge pull request #561 from mendableai/bug/dealing-with-dns-error"
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
This reverts commit 2030ec6031, reversing
changes made to f494d2b707.
2024-08-20 18:16:11 -03:00
Gergő Móricz
5818236659 fix: remove rawHtml properly 2024-08-20 22:51:12 +02:00
rafaelsideguide
0f48823c9c Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 17:24:59 -03:00
rafaelsideguide
9d64c8eedc screenshot should not be undefined also 2024-08-20 17:24:45 -03:00
Gergő Móricz
1368f9a87f fix: treat existing screenshot as a scraper success condition 2024-08-20 22:24:18 +02:00
rafaelsideguide
70d81ca69e Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 17:08:07 -03:00
rafaelsideguide
537fc689b6 fixing request 2024-08-20 17:08:02 -03:00
rafaelsideguide
f98be7d94e Update fireEngine.ts 2024-08-20 16:53:01 -03:00
Nicolas
0c48c8a436 Nick: billing for map 2024-08-20 16:43:46 -03:00
Gergő Móricz
f494d2b707 prioritize search lower 2024-08-20 21:18:31 +02:00
Nicolas
59eb552df5
Merge pull request #560 from mendableai/feat/cancel-fire-engine
[Feat] Added check job and cancel to fire-engine requests
2024-08-20 15:43:31 -03:00
rafaelsideguide
1f27182a13 added try catch 2024-08-20 15:42:39 -03:00
Gergő Móricz
efb91f9ca5 fix(search): delete jobs after done 2024-08-20 20:29:08 +02:00
Nicolas
39388cdc35 Update crawl.ts 2024-08-20 14:41:43 -03:00
Nicolas
674adee144 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-20 14:41:05 -03:00
Nicolas
b36faeaf54 Nick: 2024-08-20 14:39:52 -03:00
Gergő Móricz
cf32893c2e add strict enforcement + move crawlerOptions to top-level in /crawl 2024-08-20 19:31:26 +02:00
rafaelsideguide
e326249a57 added check job and cancel to fire-engine requests 2024-08-20 14:26:42 -03:00
Gergő Móricz
70d50b3640 fix(queue-worker): move dotenv config up 2024-08-20 19:25:19 +02:00
Nicolas
c5ad4dedeb Update crawl.ts 2024-08-20 14:19:20 -03:00
Nicolas
de0dc20a02 Update credit_billing.ts 2024-08-20 14:18:14 -03:00
Nicolas
5abd26a267 Nick: set the crawl limit to the remaining credits 2024-08-20 14:16:54 -03:00
Nicolas
e200ec9e12 Nick: 2024-08-20 12:24:14 -03:00
Nicolas
55dad82df1 Nick: fixed map search 2024-08-20 12:17:53 -03:00
Nicolas
27903247b6 Nick: map tests and fixes 2024-08-20 12:04:08 -03:00
rafaelsideguide
fa89d2e535 v1 support for crawl/monitor status 2024-08-20 10:37:24 -03:00
rafaelsideguide
7727302ef1 Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-20 09:22:55 -03:00
rafaelsideguide
e160d5529a fixed test 2024-08-20 09:22:38 -03:00
rafaelsideguide
e1c9cbf709 bug fixed. crawl should not stop if sitemap url is invalid
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-20 09:11:58 -03:00
rafaelsideguide
ecd472356b added variables to beta customers 2024-08-19 16:41:54 -03:00
Nicolas
3dc298be54 Nick: 2x rate limits for standard and growth for /scrape 2024-08-19 13:52:54 -03:00
rafaelsideguide
32aba44167 fixing merge issues 2024-08-19 13:37:20 -03:00
rafaelsideguide
72461ce9a6 Update index.test.ts 2024-08-19 13:29:52 -03:00
rafaelsideguide
fd7fdc1d52 added blocklist middleware 2024-08-19 13:28:54 -03:00
rafaelsideguide
e516e4993b Merge remote-tracking branch 'origin/v1-webscraper' into v1/node-sdk 2024-08-19 11:24:57 -03:00
Nicolas
8e4ca86463 Update crawl.ts 2024-08-19 11:02:24 -03:00
Nicolas
36b35dbc67 Update crawl.ts 2024-08-19 11:01:26 -03:00
rafaelsideguide
4ffc60596a Update queue-worker.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-19 09:29:23 -03:00
rafaelsideguide
b8170aaa47 Update blocklist.ts 2024-08-19 08:51:48 -03:00
Nicolas
3fe82b4f12 Update queue-worker.ts
Some checks failed
Fly Deploy / Pre-deploy checks (push) Has been cancelled
Fly Deploy / Test Suite (push) Has been cancelled
Fly Deploy / Python SDK Tests (push) Has been cancelled
Fly Deploy / JavaScript SDK Tests (push) Has been cancelled
Fly Deploy / Go SDK Tests (push) Has been cancelled
Fly Deploy / Deploy app (push) Has been cancelled
Fly Deploy / Build and publish Python SDK (push) Has been cancelled
Fly Deploy / Build and publish JavaScript SDK (push) Has been cancelled
2024-08-17 03:09:31 -04:00
Nicolas
f797380112 Nick:
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-16 22:17:38 -04:00
Nicolas
47123be783 Nick: weird activity block 2024-08-16 22:01:56 -04:00
Nicolas
ff84f1fe5e Update map.ts 2024-08-16 20:42:36 -04:00
Nicolas
4314313477 Update map.ts 2024-08-16 19:56:18 -04:00
Nicolas
af9a0a6f0b Update map.ts 2024-08-16 19:56:03 -04:00
Nicolas
ba5279eafc Nick: all tests passing 2024-08-16 19:55:44 -04:00
Nicolas
5205c5f005 Update map.ts 2024-08-16 19:37:00 -04:00
Nicolas
0c05d096a9 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-16 19:33:58 -04:00
Nicolas
ab48353226 Nick: /map almost good 2024-08-16 19:33:57 -04:00
Gergő Móricz
eb84673b06 feat: crawl status websocket WIP 2024-08-17 01:04:14 +02:00
Gergő Móricz
e2a6ef26d3 mount v1Router under v1 path 2024-08-16 23:48:50 +02:00
Gergő Móricz
4c1b74dab3 fix(map): remove robots.txt 2024-08-16 23:46:10 +02:00
Gergő Móricz
c281fe62c0 fix(crawl): propagate db fix to preview endpoint 2024-08-16 23:43:54 +02:00
Gergő Móricz
803577eeba feat(crawl): webhook 2024-08-16 23:42:48 +02:00
Gergő Móricz
e6738abf96 fix(crawl-status): retrieve from DB in bulk 2024-08-16 23:39:39 +02:00
rafaelsideguide
086ba6280b fixed markdown format 2024-08-16 18:39:13 -03:00
Gergő Móricz
aabfaf0ac5 clean up crawl-status, fix db ddos 2024-08-16 23:29:39 +02:00
rafaelsideguide
e5b807ccc4 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-16 17:57:31 -03:00
rafaelsideguide
7a61325500 map + search + scrape markdown bug 2024-08-16 17:57:11 -03:00
Gergő Móricz
5896153d19 fix: crawl status and redis fixes 2024-08-16 22:52:48 +02:00
Gergő Móricz
3fcb21930e remove log 2024-08-16 22:48:23 +02:00
Gergő Móricz
f20328bdbb crawl status and document stuff 2024-08-16 22:48:05 +02:00
Nicolas
0c057bb649 Update index.test.ts 2024-08-16 16:45:10 -04:00
Nicolas
b32464558a Update index.test.ts 2024-08-16 16:41:09 -04:00
Nicolas
5bac7988a6 Update index.test.ts 2024-08-16 16:08:38 -04:00
Nicolas
290c7ee936 Update index.test.ts 2024-08-16 16:06:46 -04:00
Nicolas
23a033fe61 Nick: fixes and more e2e tests 2024-08-16 16:03:35 -04:00
Nicolas
78ca94251c
Merge pull request #480 from mendableai/nsc/hyper-v81
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
Reduce metrics ingestion w/ HyperDX v0.8.1
2024-08-16 14:34:14 -04:00
Nicolas
37ae9a9043 Update index.test.ts 2024-08-16 14:17:43 -04:00
Nicolas
200ce8e2ce Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-16 14:16:35 -04:00
Nicolas
21d3798e49 Nick: initial e2e v1 tests for /scrape 2024-08-16 14:16:30 -04:00
rafaelsideguide
3f998b688d scrape ready 2024-08-16 15:14:37 -03:00
Nicolas
b0d211ecc1 Merge branch 'main' into v1-webscraper 2024-08-16 13:43:28 -04:00
Gergő Móricz
fd6432e7fd fix(queue-worker): correct job success 2024-08-16 19:16:08 +02:00
Gergő Móricz
6e54942265 fix(queue-worker): add cancelled to crawl log 2024-08-16 19:11:53 +02:00
rafaelsideguide
9b1cb266a0 added origin to request types 2024-08-16 13:49:50 -03:00
Gergő Móricz
d0a8382a5b fix(queue-worker): crawl finishing race condition 2024-08-16 18:48:52 +02:00
Gergő Móricz
6bd52e63bf fix(queue-worker): fix linksOnPage undefined error 2024-08-16 18:42:24 +02:00
Gergő Móricz
5a6570cba2 fix(webhooks): call back with parent crawl ID 2024-08-16 17:42:42 +02:00
rafaelsideguide
7d324bd2c8 Create checkCredits.ts 2024-08-16 11:21:52 -03:00
Nicolas
ec361609d2 Nick: added growth-2x plan
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-15 18:37:19 -04:00
Gergő Móricz
8b7569f8f3 add zod, create middleware, update openapi declaration, add crawl logic 2024-08-15 23:30:33 +02:00
Gergő Móricz
4165de1773 v1 restructure 2024-08-15 21:51:59 +02:00
Gergő Móricz
af08ab0b1a fix bad module resolution 2024-08-15 21:51:52 +02:00
Nicolas
c917c8fbcd Merge branch 'main' into v1-webscraper 2024-08-15 15:14:29 -04:00
Nicolas
32c6b1f136 Nick: remove active job alerts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-15 14:50:30 -04:00
Gergő Móricz
0c14366720 fix: add checkandupdateurl to crawlPreview 2024-08-15 20:30:25 +02:00
Nicolas
81b2479db3
Merge pull request #459 from mendableai/feat/queue-scrapes
feat: Move scraper to queue
2024-08-15 14:19:55 -04:00
Gergő Móricz
fc08ff450d search port 2024-08-15 20:10:59 +02:00
Nicolas
86326f34e9 Update single_url.test.ts 2024-08-15 13:48:42 -04:00
Gergő Móricz
129a882bcc fix(scrape): give scrapes their real job id 2024-08-15 19:29:47 +02:00
Gergő Móricz
965a5817d1 fix(queue-worker): log jobs correctly 2024-08-15 19:27:15 +02:00
Gergő Móricz
dad9d353d9 use thomas's url validation 2024-08-15 19:19:02 +02:00
Gergő Móricz
e3279274f1 fix: make playground crawl work 2024-08-15 19:14:32 +02:00
Gergő Móricz
c5597bc722 fix: robots.txt laoding 2024-08-15 19:11:07 +02:00
Gergő Móricz
29f0d9ec94 propagate priority to fire-engine 2024-08-15 19:04:46 +02:00
Gergő Móricz
b79d3d1754 fix 2024-08-15 19:02:05 +02:00
Gergő Móricz
57730f6a35 priority changes 2024-08-15 18:58:07 +02:00
Gergő Móricz
846610681b fix: fix posthog, add dummy crawl DB items 2024-08-15 18:55:18 +02:00
rafaelsideguide
81066cf90a updating cargo pckg name n version 2024-08-15 10:11:27 -03:00
Nicolas
6e1074cdd1 Update website_params.ts
Some checks are pending
Fly Deploy / Pre-deploy checks (push) Waiting to run
Fly Deploy / Test Suite (push) Blocked by required conditions
Fly Deploy / Python SDK Tests (push) Blocked by required conditions
Fly Deploy / JavaScript SDK Tests (push) Blocked by required conditions
Fly Deploy / Go SDK Tests (push) Blocked by required conditions
Fly Deploy / Deploy app (push) Blocked by required conditions
Fly Deploy / Build and publish Python SDK (push) Blocked by required conditions
Fly Deploy / Build and publish JavaScript SDK (push) Blocked by required conditions
2024-08-14 17:39:54 -04:00