dify/api/core/rag/extractor
2024-10-25 22:32:41 +08:00
..
blob chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
entity chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
firecrawl chore: refurish python code by applying Pylint linter rules (#8322) 2024-09-13 22:42:08 +08:00
unstructured nltk security issue and upgrade unstructured (#9558) 2024-10-23 16:23:55 +08:00
csv_extractor.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
excel_extractor.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
extract_processor.py Added description for .ppt, specify the reason for unstructured.io (#9452) 2024-10-24 22:13:06 +08:00
extractor_base.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
helpers.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
html_extractor.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
jina_reader_extractor.py feat(website-crawl): add jina reader as additional alternative for website crawling (#8761) 2024-09-30 09:57:19 +08:00
markdown_extractor.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
notion_extractor.py chore: refurish python code by applying Pylint linter rules (#8322) 2024-09-13 22:42:08 +08:00
pdf_extractor.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
text_extractor.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
word_extractor.py fix: wrong element object (#9868) 2024-10-25 22:32:41 +08:00