0.1.0	2024年6月3日

#527 in 图像

Apache-2.0

52KB
1K SLoC

layoutparser-ort

LayoutParser的简化版本，用于检测文档中的布局元素。通过onnxruntime（通过ort绑定）运行Detectron2和YOLOX布局模型（来自unstructured-inference的ONNX格式）。查看示例以快速入门！

许可证

layoutparser-ort从LayoutParser镜像其API，并包括来自unstructured-inference的预处理代码（均采用Apache License 2.0许可）。同样，layoutparser-ort也采用Apache License 2.0许可。

surya：支持90多种语言的OCR、布局分析、阅读顺序、行检测
- SegFormer（transformers：SegFormer）、Donut（transformers：Donut）、CRAFT（pytorch）
- 许可证：GPLv3.0（代码），cc-by-nc-sa-4.0（模型）
  - cc-by-nc-sa-4.0：非商业但作者“waive[s] that for any organization under $5M USD in gross revenue in the most recent 12-month period.”
unstructured-inference：布局解析模型的托管模型推理代码
- 模型：Detectron2（LayoutParser-PubLayNet-PyTorch、LayoutParser-PubLayNet-ONNX）、YOLOX（可能在DocLayNet上训练、量化、ONNX）、Table-Transformer（transformers：Table Transformer）、Donut（transformers：Donut）
- 许可证：Apache 2.0
LayoutParser：基于深度学习的文档图像分析统一工具包
- 模型：Detectron2
- 许可证：Apache 2.0
- 文档：https://layout-parser.readthedocs.io/en/latest/api_doc/elements.html

~5–20MB
~243K SLoC