Commit Graph

292 Commits

Author SHA1 Message Date
intergalacticalvariable
22b58be001
Update README.md 2024-09-29 12:40:29 +02:00
intergalacticalvariable
0310b00df9
Update README.md 2024-09-29 12:35:12 +02:00
intergalacticalvariable
5e947f4cfd
Update README.md 2024-09-29 12:20:25 +02:00
intergalacticalvariable
3ecb275e35
Update README.md 2024-09-28 12:51:20 +02:00
intergalacticalvariable
20945d733d
Update README.md 2024-09-28 10:54:28 +02:00
intergalacticalvariable
e1d45875fe
Update README.md 2024-09-28 10:49:56 +02:00
intergalacticalvariable
a846c24307
Update README.md 2024-09-28 10:47:36 +02:00
intergalacticalvariable
1e2a35e04e Updated README 2024-09-28 08:45:57 +00:00
intergalacticalvariable
7debf71cb8 Updated Readme 2024-09-28 08:39:52 +00:00
Generic Developer
4f5d1b519a updated Readme 2024-09-28 08:15:56 +00:00
Generic Developer
9b0a0b91b8 works ;) 2024-09-28 02:31:42 +00:00
Generic Developer
025c8b67b0 works with urls 2024-09-28 02:01:01 +00:00
Generic Developer
1bcfead104 screenshots allmost work 2024-09-28 01:10:17 +00:00
Generic Developer
3d3863f369 docker markdown working 2024-09-27 23:56:51 +00:00
Harsh Gupta
66e5d42d2e cleanup 2024-09-18 09:48:30 +05:30
Harsh Gupta (aider)
cc4d316764 fix: Improve logging and validation for cookie setting in web scraping 2024-08-26 17:40:50 +05:30
Harsh Gupta
6309cbd7a0 update the start script 2024-08-26 17:36:54 +05:30
Harsh Gupta
66183d8216 set cookies properly 2024-08-15 22:48:52 +05:30
Harsh Gupta (aider)
49a90ee7d4 feat: use urlToCrawl in cookie url 2024-08-15 22:43:50 +05:30
Harsh Gupta
dcc3e5294d fix: Update crawler.ts to log cookies 2024-08-15 22:43:49 +05:30
Harsh Gupta (aider)
1a5f2eb408 feat: Read cookies from x-set-cookie header and set those cookies in crawlOpts, the url needs to be read from the request parameters 2024-08-15 22:40:49 +05:30
Harsh Gupta
fc0023f381 manually set cookie 2024-08-15 22:33:58 +05:30
Harsh Gupta
953429218a Revert "WIP: Cookie fixes"
This reverts commit 12850d79c7.
2024-08-15 21:31:17 +05:30
Harsh Gupta
12850d79c7 WIP: Cookie fixes 2024-08-15 21:23:04 +05:30
Harsh Gupta
bb3f6b3199
Update README.md 2024-08-15 16:48:09 +05:30
Harsh Gupta
abef29075c remove empty file 2024-08-15 16:38:15 +05:30
Harsh Gupta
f60a2a19fb update installation/usage instruction 2024-08-15 16:36:59 +05:30
Harsh Gupta (aider)
aa0dcea9b0 docs: Add setup and usage instructions to README 2024-08-15 16:36:47 +05:30
Harsh Gupta
a7fbe3cb38 finish responding properly 2024-08-15 15:42:39 +05:30
Harsh Gupta (aider)
7677ec77ce Parse request headers properly 2024-08-15 15:12:54 +05:30
Harsh Gupta
19dc9df9cb more console logs 2024-08-15 15:01:48 +05:30
Harsh Gupta (aider)
f6ee7ca6e5 fix: Improve header handling in CrawlerOptions.from() 2024-08-15 14:59:51 +05:30
Harsh Gupta
c77135490b feat: Add logging for scrapping options context 2024-08-15 14:59:49 +05:30
Harsh Gupta
77be0d08ff more console logs 2024-08-14 20:44:25 +05:30
Harsh Gupta (aider)
56c1d461ec feat: Add console.log statements to crawler.ts 2024-08-14 20:42:51 +05:30
Harsh Gupta
3e2bf6d39d add an express endpoint to run the crawl endpoint 2024-08-14 19:34:53 +05:30
Harsh Gupta (aider)
57b07507d1 feat: Add Express server with crawl endpoint 2024-08-14 19:17:25 +05:30
Harsh Gupta
32263c7e9e feat: Add express server for crawling functionality 2024-08-14 19:17:23 +05:30
Harsh Gupta
1d8b3eae0d increase memory limit 2024-08-14 16:59:51 +05:30
Harsh Gupta (aider)
b682ee5bb5 feat: add hello world endpoint in firebase cloud functions 2024-08-14 16:42:03 +05:30
Harsh Gupta
f927aab144 update deps 2024-08-14 16:40:13 +05:30
Harsh Gupta
54aae972ae fix the puppeteer thingy 2024-08-14 16:39:59 +05:30
Harsh Gupta (aider)
a72373f815 fix: Add try-catch block to handle errors in salvage method 2024-08-14 16:04:57 +05:30
Harsh Gupta
888546e064 fix: Make the salvage method private in the puppeteer service 2024-08-14 16:04:56 +05:30
Harsh Gupta (aider)
ef138360c2 fix: Remove private modifier from salvage method 2024-08-14 16:01:00 +05:30
Harsh Gupta (aider)
f6f3fc5bea fix: Improve error handling and add retry mechanism in PuppeteerControl 2024-08-14 15:49:13 +05:30
Harsh Gupta (aider)
a3a299fb38 fix: Implement retry mechanism and improve error handling for scraping function 2024-08-14 15:46:41 +05:30
Harsh Gupta
ddbf0030b4 fix the logger thingy 2024-08-14 15:35:20 +05:30
Harsh Gupta (aider)
a3f222638e feat: Add shared module dependencies and exports 2024-08-14 15:15:07 +05:30
Harsh Gupta (aider)
02abc2aaaa fix: Register Logger class with dependency injection container 2024-08-14 15:11:14 +05:30