Jasaxion一只大雄

风打,碎琉璃; 打不碎的,是那阳光漫地。The wind strikes, shattering the glazed glass; Yet unbroken remains the sunlight spilling across the earth.

LLM指定格式输出与 LLM OCR 技术实用技术

2025-02-27


LLM 文本 PDF 提取技术

GitHub 资源

  1. MinerU&PDF-Extract-Kit:https://github.com/opendatalab/MinerU?tab=readme-ov-file
  2. olmOCR:https://github.com/allenai/olmocr
  3. LLM-Aided OCR Project:https://github.com/Dicklesworthstone/llm_aided_ocr

LLM 特定格式输出 (Json)

GitHub 资源

  1. outlines:https://github.com/dottxt-ai/outlines/tree/main
  2. guidance:https://github.com/guidance-ai/guidance
  3. super-json-mode:https://github.com/varunshenoy/super-json-mode
  4. Awesome LLM JSON List:https://github.com/imaurer/awesome-llm-json
  5. Enhancing JSON Output with Large Language Models: A Comprehensive Guide:https://medium.com/@dinber19/enhancing-json-output-with-large-language-models-a-comprehensive-guide-f1935aa724fb