mirror of
https://github.com/intergalacticalvariable/reader.git
synced 2024-11-16 03:32:25 +08:00
chore: rename url2text to reader
This commit is contained in:
parent
8378cb06ee
commit
d7fbc41ba2
19
README.md
19
README.md
|
@ -20,9 +20,23 @@ Use accept-header to control the streaming behavior:
|
|||
curl -H "Accept: text/event-stream" https://r.jina.ai/https://en.m.wikipedia.org/wiki/Main_Page
|
||||
```
|
||||
|
||||
If your downstream LLM/agent system requires immediate content delivery or needs to process data in chunks to interleave the IO and LLM time, use Streaming Mode. This allows for quicker access and efficient handling of data:
|
||||
|
||||
```text
|
||||
|
||||
Reader API: streamContent1 ----> streamContent2 ----> streamContent3 ---> ...
|
||||
| | |
|
||||
v | |
|
||||
Your LLM: LLM(streamContent1) | |
|
||||
v |
|
||||
LLM(streamContent2) |
|
||||
v
|
||||
LLM(streamContent3)
|
||||
```
|
||||
|
||||
### JSON mode
|
||||
|
||||
This is still very early and the result is not really a good JSON but three simple field `url`, `title` and `content`. You can use accept-header to control the output format:
|
||||
This is still very early and the result is not really a "useful" JSON. It contains three fields `url`, `title` and `content` only. Nonetheless, you can use accept-header to control the output format:
|
||||
```bash
|
||||
curl -H "Accept: application/json" https://r.jina.ai/https://en.m.wikipedia.org/wiki/Main_Page
|
||||
```
|
||||
|
@ -47,5 +61,8 @@ You might notice a reference to `thinapps-shared` submodule, an internal package
|
|||
|
||||
That said, this repo is *the* codebase behind `https://r.jina.ai`, so everytime we update here, will deploy the new version to the `https://r.jina.ai`.
|
||||
|
||||
## Having trouble on some websites?
|
||||
Please raise an issue with the URL you are having trouble with. We will look into it and try to fix it.
|
||||
|
||||
## License
|
||||
Apache License 2.0
|
Loading…
Reference in New Issue
Block a user