使用paddleocr实现 (#342)
* jpg and png ocr * fix * write docs to tmp file * fix * [BUGFIX] local_doc_qa.py line 172: logging have no end args. (#323) * image loader * fix * fix * update api.py * update api.py * update api.py * update README.md * update api.py * add pdf_loader * fix --------- Co-authored-by: RainGather <3255329+RainGather@users.noreply.github.com> Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
正在显示
docs/test.pdf
0 → 100644
File added
img/qr_code_16.jpg
deleted
100644 → 0
269.3 KB
img/qr_code_17.jpg
0 → 100644
276.4 KB
img/test.jpg
0 → 100644
7.9 KB
loader/__init__.py
0 → 100644
loader/image_loader.py
0 → 100644
loader/pdf_loader.py
0 → 100644
pymupdf | |||
paddlepaddle==2.4.2 | |||
paddleocr | |||
langchain==0.0.146 | langchain==0.0.146 | ||
transformers==4.27.1 | transformers==4.27.1 | ||
unstructured[local-inference] | unstructured[local-inference] | ||
... | ... |
test_image.py
0 → 100644
test_pdf.py
0 → 100644