OCR Extractor

by Johnathan Ritzi
favorite
share
0.0
(0)
5
4
3
2
1
Score: 40/100
Description

The OCR Extractor plugin focuses on turning embedded documents and images into searchable text using optical character recognition. It processes attachments already present in notes and converts the extracted content into clean Markdown, placing it directly below the original file inside a collapsible callout. This approach keeps the raw files untouched while still making their contents visible, searchable, and indexable by both internal search and system-level tools. The plugin supports batch extraction, either for a single note or across the entire vault, with progress shown in the status bar and the option to cancel midway. Text extraction is powered by Mistral OCR, which handles complex layouts better than basic OCR engines.

Reviews
No reviews yet.
Stats
9
stars
463
downloads
1
forks
38
days
2
days
5
days
4
total PRs
0
open PRs
2
closed PRs
2
merged PRs
2
total issues
1
open issues
1
closed issues
0
commits
RequirementsExperimental
Latest Version
6 days ago
Changelog
  • Add support for local OCR with Tesseract
  • Improved architecture for adding other services in the future
  • Better handling of Mistral errors
  • Improve status handling and messaging
  • Improve OCR service documentation

Full Changelog: https://github.com/jritzi/ocr-extractor/compare/1.1.0…1.2.0

README file from
Similar Plugins
info
• Similar plugins are suggested based on the common tags between the plugins.
Omnisearch
4 years ago by Simon Cambier
A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
Text Extractor
3 years ago by Simon Cambier
A (companion) plugin to facilitate the extraction of text from images (OCR) and PDFs.
Taskbone
5 years ago by Dominik Schlund
Obsidian OCR plugin - extract text from images
MathLive
3 years ago by Dan Zilberman
The must-have plugin for math in Obsidian
Obsidian OCR
3 years ago by Jonas Mohr
Obsidian OCR allows you to search for text in your images and pdfs
Image OCR
3 years ago by kaffarell
Runs ocr on pasted images and posts result in details box. This allows to search in images.
Image2LaTEX
2 years ago by Hugo Persson
This is a plugin for obsidian that will read your latest copied image from clipboard and generate math latex from it
AI Image OCR
5 months ago by Rootiest
Obsidian plugin for AI-powered text extraction from images
Image to text OCR
2 years ago by Dario Baumberger
Convert a image in your note to text.
Vision Recall
a year ago by Travis Van Nimwegen
Transform screenshots into searchable Obsidian notes using AI vision and text analysis
Images to Notes
8 months ago by Rodolfo Terriquez
Turn photos of your handwritten notes into markdown
Handwriting OCR
6 months ago by ikmolbo
Transform handwritten documents and scanned images into editable text with Handwriting OCR's AI-powered handwriting to text conversion.
Student Repo
10 months ago by Feirong.zfr
学生知识库助手(Student Repository Helper)是一个面向学生或学生家长的Obsidian 插件,这款插件旨在解决学生在学习阶段面临的资料管理难题,将学习过程中产生的各类重要资料,如试卷、笔记、关键文档、绘画手工作品等,进行系统性的数字化整合与管理,并利用 AI 助手定期进行学习分析总结。随着时间的推移,它将助力你逐步搭建起一座专属你自己的知识宝库,这座宝库将伴随你一生,成为你知识成长与积累的见证。