LLM指定格式输出与 LLM OCR 技术实用技术
2025-02-27
LLM 文本 PDF 提取技术
GitHub 资源
- MinerU&PDF-Extract-Kit:https://github.com/opendatalab/MinerU?tab=readme-ov-file
- olmOCR:https://github.com/allenai/olmocr
- LLM-Aided OCR Project:https://github.com/Dicklesworthstone/llm_aided_ocr
LLM 特定格式输出 (Json)
GitHub 资源
- outlines:https://github.com/dottxt-ai/outlines/tree/main
- guidance:https://github.com/guidance-ai/guidance
- super-json-mode:https://github.com/varunshenoy/super-json-mode
- Awesome LLM JSON List:https://github.com/imaurer/awesome-llm-json
- Enhancing JSON Output with Large Language Models: A Comprehensive Guide:https://medium.com/@dinber19/enhancing-json-output-with-large-language-models-a-comprehensive-guide-f1935aa724fb