使用paddleocr实现实现UnstructuredPaddlePDFLoader和UnstructuredPaddleImageLoader (#344)
* jpg and png ocr * fix * write docs to tmp file * fix * image loader * fix * fix * add pdf_loader * fix * update INSTALL.md --------- Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
正在显示
docs/test.pdf
0 → 100644
File added
img/test.jpg
0 → 100644
7.9 KB
test_image.py
0 → 100644
test_pdf.py
0 → 100644