<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:media="http://search.yahoo.com/mrss/" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>MinerU on recca0120 Tech Notes</title><link>https://recca0120.github.io/en/tags/mineru/</link><description>Recent content in MinerU on recca0120 Tech Notes</description><generator>Hugo -- gohugo.io</generator><language>en</language><lastBuildDate>Fri, 24 Apr 2026 21:00:00 +0800</lastBuildDate><atom:link href="https://recca0120.github.io/en/tags/mineru/index.xml" rel="self" type="application/rss+xml"/><item><title>MinerU in Practice: Turning PDFs into RAG-Ready Markdown</title><link>https://recca0120.github.io/en/2026/04/24/mineru-pdf-to-markdown/</link><pubDate>Fri, 24 Apr 2026 21:00:00 +0800</pubDate><guid>https://recca0120.github.io/en/2026/04/24/mineru-pdf-to-markdown/</guid><description>Feeding PDFs to LLMs breaks formulas, tables, and multi-column layouts. I ran MinerU 2.5 on an academic PDF — formulas became LaTeX, tables became HTML, reading order preserved, and it runs on CPU.</description><content:encoded>&lt;![CDATA[Feeding PDFs to LLMs breaks formulas, tables, and multi-column layouts. I ran MinerU 2.5 on an academic PDF — formulas became LaTeX, tables became HTML, reading order preserved, and it runs on CPU.<br/><img src="https://recca0120.github.io/2026/04/24/mineru-pdf-to-markdown/featured.png" alt="Featured image"/>]]></content:encoded><enclosure url="https://recca0120.github.io/2026/04/24/mineru-pdf-to-markdown/featured.png" type="image/png" length="0"/><media:content url="https://recca0120.github.io/2026/04/24/mineru-pdf-to-markdown/featured.png" medium="image"/><category>MinerU</category><category>PDF</category><category>RAG</category><category>OCR</category><category>LLM</category><category>AI</category></item></channel></rss>