Obsidian OCR

by Jonas Mohr
5
4
3
2
1
Score: 47/100

Description

Category: Note Enhancements

The Obsidian OCR plugin adds optical character recognition (OCR) capabilities to Obsidian, allowing you to search for text within images and PDFs. By integrating Tesseract, the plugin automatically processes new image and PDF files added to your vault. It features customizable settings such as adjusting OCR processes, image quality, and image density. The plugin supports multiple OCR providers, with Tesseract as the default, and offers flexibility in configuration to cater to different OCR needs.

Reviews

No reviews yet.

Stats

302
stars
22,053
downloads
5
forks
1,282
days
30
days
849
days
46
total PRs
13
open PRs
4
closed PRs
29
merged PRs
29
total issues
12
open issues
17
closed issues
2
commits

Latest Version

2 years ago

Changelog

Welcome back! After almost 1 year on hiatus, I'm back with this release.

  • Fixed the "out of memory error", some users were having trouble with
  • Removed automatic installation feature It never got out of alpha, because it was a pain to manager and not really useful
  • Updated some dependencies
  • Removed unused dependencies
  • Removed some dead code
  • Switched to webpack

README file from

Github

Obsidian
TypeScript NPM Git WebStorm
Windows Linux macOS

⚠️ Still in early development ⚠️

Obsidian OCR

Obsidian OCR allows you to search for text in your images and PDFs

Installation

Dependencies
  • tesseract for OCR
  • imagemagick for pdf to png conversion
Windows
Manual installation

❗Make sure the executables are in your path. If you don't know how look here: https://www.architectryan.com/2018/03/17/add-to-the-path-on-windows-10/

Automatic installation

❗The automatic installation features is still in very early development. Expect bugs❗
ℹ️ Obsidian OCR uses chocolatey for automatic installation ℹ

  • Install & enable the plugin
  • Under settings, press the automatic install button
MacOS
  • Install tesseract: brew install tesseract
  • Install tesseract-lang : brew install tesseract-lang
  • Install imagemagick : brew install imagemagick

For the path: check where the binaries are located and add these to "/private/etc/paths" (I also added them to ~/.zshrc, not sure if that is needed)

  • brew list tesseract in my case: /opt/homebrew/Cellar/tesseract/5.2.0/bin/
  • brew list tesseract-lang in my case: /opt/homebrew/Cellar/tesseract/5.2.0/bin/
  • brew list imagemagick in my case: /opt/homebrew/Cellar/imagemagick/7.1.0-43/bin/
Ubuntu
Manual installation
  • Run sudo apt install -y tesseract-ocr imagemagick
  • Install any languages you need by installing the appropriate package (usually named tesseract-ocr-<lang>)
Automatic installation

❗The automatic installation features is still in very early development. Expect bugs❗

  • Install & enable the plugin
  • Under settings, press the automatic install button
Arch / Manjaro
Manual installation
  • Run sudo pacman -S tesseract imagemagick
  • Install any languages you need by installing the appropriate package (usually named tesseract-data-<lang>)
Automatic installation

❗The automatic installation features is still in very early development. Expect bugs❗

  • Install & enable the plugin
  • Under settings, press the automatic install button
Flatpak

If Obsidian is running via the Flatpak installation (such as provided by default in Pop!_OS) then this plugin will not operate. Flatpak sandboxing will change the filepaths so even providing host access will still be problematic. If you have a Flatpak installation you will need to reinstall via a different method to successfully use this plugin.

Plugin installation
  • Simply install the plugin from the community plugins

Usage

  • Upon enabling the plugin, you will see a progress bar in the bottom right, indicating that your vault is currently being processed. Indexing
  • Upon completion, you can either use the Search OCR command or the magnifying-glass icon in the ribbon to open the search menu. SearchModal
  • When adding a new PDF / PNG to your vault, the file is automatically being searched for text.
  • Upon starting Obsidian, you will now see another progress bar, indicating that all transcripts are being cached.
    Caching

Settings

Obsidian OCR offers a variety of settings you can configure yourself.

Name Description Default
Max OCR Processes The maximum amount of ocr processes running at the same time. Increasing this speeds up indexing but also increases cpu usage 1
Max caching processes The maximum amount of caching processes running at the same time. Increaing this speeds up caching but also increases cpu usage 10
OCR Image Decides whether or not images (.png, .jpg, .jpeg) are OCRed true
OCR PDF Decides whether or not PDFs (.pdf) are OCRed true
Image density The density of generated PNGs, in dpi. Increasing this helps to OCR smaller text, but increases cpu usage 300
Image quality The quality of generated PNGs. Increasing this helps to OCR smaller text, but increaes cpu usage 98
Additional imagemagick args Additional commandline arguments passed to imagemagick when converting a PDF to PNG(s)
Additional search paths Additional paths that will be searched when looking for external dependencies. Useful when installing into custom directories
OCR Provider The OCR provider that will be used. See below for a description of providers NoOp

OCR Providers

List of OCR providers
Name Description
NoOp The NoOp (no operation) provider does, as the name implies, nothing and is only a dummy provider. To get real OCR capability, you have to switch to another provider
Tesseract OCR provider using tesseract to OCR documents locally on your computer
Tesseract

By default, tesseract offers two languages to choose from: eng and osd.

  • Osd isn't a language, but rather stands for Orientation and script detection and is therefore not useful for our use.
  • Eng stands for english and should be chosen if your documents are written in english.

ℹ After switching your language, only newly indexed documents use the new language. You can reindex your already added documents by using the Delete all transcripts command ℹ

Lang vs Script

Tesseract supports langs and scripts for text recognition.

  • Langs support a single language
  • Scripts support multiple languages with similar characters. Latin, for example, supports all latin-based languages (English, German, Italian, France, ...) except for Vietnamese
Custom OCR providers

To add a custom OCR provider, create a new class that extends OCRProvider and register it using OCRProviderManager.registerOCRProviders(new MyCustomProvider())

Similar Plugins

info
• Similar plugins are suggested based on the common tags between the plugins.
Image Inserter
4 years ago by Ray Hao
An Obsidian plugin helps users easily search and insert images to editors from Unsplash.
Text Extractor
3 years ago by Simon Cambier
A (companion) plugin to facilitate the extraction of text from images (OCR) and PDFs.
Image OCR
3 years ago by kaffarell
Runs ocr on pasted images and posts result in details box. This allows to search in images.
MathLive
3 years ago by Dan Zilberman
The must-have plugin for math in Obsidian
Image2LaTEX
3 years ago by Hugo Persson
This is a plugin for obsidian that will read your latest copied image from clipboard and generate math latex from it
Image to text OCR
2 years ago by Dario Baumberger
Convert a image in your note to text.
Image Search
2 years ago by Mohammad Razeghi
Obsidian Image Search Using Brave API
Taskbone
5 years ago by Dominik Schlund
Obsidian OCR plugin - extract text from images
Omnisearch
4 years ago by Simon Cambier
A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
Vision Recall
a year ago by Travis Van Nimwegen
Transform screenshots into searchable Obsidian notes using AI vision and text analysis
Student Repo
a year ago by Feirong.zfr
学生知识库助手(Student Repository Helper)是一个面向学生或学生家长的Obsidian 插件,这款插件旨在解决学生在学习阶段面临的资料管理难题,将学习过程中产生的各类重要资料,如试卷、笔记、关键文档、绘画手工作品等,进行系统性的数字化整合与管理,并利用 AI 助手定期进行学习分析总结。随着时间的推移,它将助力你逐步搭建起一座专属你自己的知识宝库,这座宝库将伴随你一生,成为你知识成长与积累的见证。
Images to Notes
a year ago by Rodolfo Terriquez
Turn photos of your handwritten notes into markdown
Handwriting OCR
9 months ago by ikmolbo
Transform handwritten documents and scanned images into editable text with Handwriting OCR's AI-powered handwriting to text conversion.
AI Image OCR
8 months ago by Rootiest
Obsidian plugin for AI-powered text extraction from images
OCR Extractor
5 months ago by Johnathan Ritzi
Obsidian plugin to extract text from PDFs, documents, images, etc. and store it as Markdown in notes