Hat schon mal jemand mit #imagemagick ein Bild von einem digitalen Display für #tesseract OCR aufbereitet? Wie sähe eine näherungsweise sauber arbeitende Zeile für #convert hinsichtlich Schärfung, Kontrastverbesserung und Graustufenkonversion aus? Retoot gerne gesehen.

**Eugen Rochko** @Gargron@mastodon.social · Mar 30

Mar 30

Eugen Rochko @Gargron@mastodon.social

Looks like #TheDearHunter is coming back to Europe this year at #BeProgMyFriend in Spain and #Euroblast in Germany, both in September. Alongside #TesseracT!

**Tommi Nieminen** @tomminieminen@mastodontti.fi · Feb 18

Feb 18

Tommi Nieminen @tomminieminen@mastodontti.fi

I’m still annoyed with the state of #OCR in #Linux (or #FLOSS in general). Not that the need for OCR’ing hasn’t diminished by the years, as more and more of publications are already in electronic form, but every once in a while a need arises. #Tesseract’s quality is #abysmal (and not in Joey’s sense). #ABBYYFineReader used to be the best in #Windows, and once upon a time they provided a #CLI-usable OCR engine for Linux too, but not any more. #atkjuttuja #computers

**Openhuman** @Openhuman@mastodon.online · Feb 17 *

Feb 17 *

Openhuman @Openhuman@mastodon.online

#OpenSource Programm I need.

1.some sort of an apple tags like variant for the open source world ( best is file manager from #elementaryos at this point but it only support tagging 8 colours no #
(Nice to look at automation like the Mac #hazel or Mac #defaultfolderx)

2.and #preview replacement ( pdf and other files reader with most of the pro features and some sort of working #ocr ( possibly a gui of #tesseract ? ) for #Linux and #android preferably (best I found was #pdfsambasic)

Replied to Daniela Schneider

**Benjamin Rosemann** @b2m@mastodon.social · Feb 13

Feb 13

Benjamin Rosemann @b2m@mastodon.social

@SchnDa If you are thinking less GUI and more workflow then you might also want to check #ocrd https://ocr-d.de/. Simplified it provides an abstraction over several #ocr tools including #tesseract, #calamari, #kraken, ... to build customized ocr workflows.

ocr-d.de- OCR-D

**Daniela Schneider** @SchnDa@fedihum.org · Feb 7

Feb 7

Daniela Schneider @SchnDa@fedihum.org

Hi #histodons,
I need your expertise. We want to integrate an #opensource #ocr tool into our #useGalaxy Platform so you can better analyse your texts, etc.
I worked with #tesseract some years ago, and I heard about #ocr4all.
Do you have experience with any of these - or other recommendations?
We are also integrating #tranksribus via API but want another ocr-specific option.
Looking forward to your experiences!

@galaxyfreiburg
@NFDI4Memory

Replied in thread

**Thomas** @coastgnu@norden.social · Feb 5

Feb 5

Thomas @coastgnu@norden.social

@nicosemsrott

Mit ImageToolbox¹ lässt sich der Text aus Bildern extrahieren!

Umlage Tool of verwendet Tesseract² für die OCR.

Beide Werkzeuge sind Freie Software unter der GPL Lizenz der FSF³.

¹ https://github.com/T8RIN/ImageToolbox

Für alle die Bilder mit Textpassagen einstellen!

² https://github.com/tesseract-ocr/tesseract

³ https://de.m.wikipedia.org/wiki/GNU_General_Public_License
#ImageToolbox #Tesseract

**Eugen Rochko** @Gargron@mastodon.social · Feb 4

Feb 4

Eugen Rochko @Gargron@mastodon.social

#TesseracT, #Leprous, #GreenLung and #Sungazer among others announced for this year's #ArcTanGent. Tempting

**barefootstache** @barefootstache@qoto.org · Jan 14

Jan 14

barefootstache @barefootstache@qoto.org

#DeepStash tries to be an alternate to algorithmic #SocialMedia by providing small bites of information from books, articles, and/or quotes. It has the ability to bookmark content, though it is limited for free accounts.

Alternately one can just screenshot the specific chunks of data and potentially make one's own filtered timeline if one combines the data from the screenshots into #Anki. This can either be done directly by using the images or one can quickly extract the text via #tesseract and some #python to generate a #csv file which can be used as an import into Anki.

**barefootstache** @barefootstache@qoto.org · Jan 9

Jan 9

barefootstache @barefootstache@qoto.org

#TIL that it looks like #tesseract is preinstalled on #fedora.

This means I do not need to battle with #python to extract quotes from images and instead can do it all via #bash like

```
for f in *.png; do tesseract "$f" "${f%.*}"; done
```

within the specific directory

**Yann Büchau** @nobodyinperson@fosstodon.org · Jan 7

Jan 7

Yann Büchau @nobodyinperson@fosstodon.org

I treated my #homelab some #paperless today.

Paperless NGX is a really amazing piece of software that you can throw documents at and it will #OCR them with #tesseract and allow you to search, tag and organize them.

It works very smoothly and it's currently processing all the 20k documents (incl. old versions) from our family #gitAnnex documents repo - that'll take a while

The #NixOS module is also fantasticly easy to use as always.

paperless-ngx screenshot with search for 'arduino' open. The PDF books 'Python Playground', 'Dead Simple Python' and 'Mastering ArduinoJSON' (and its invoice) are in the results.

#selfhosted #selfhosting

**Tom** @thomas@metalhead.club · Dec 17, 2024 *

Dec 17, 2024 *

Tom @thomas@metalhead.club

My Album Of The Year is: "Lingua Ignota Pt. 1" by Persefone.

I chose the album as my AOTY because I was eagerly awaiting the release and not a single song on the EP disappoints! Every single one picks me up and combines the familiar Persefone sound without being stingy with refreshing new elements. I attended their live concert shortly after the releases and couldn't be happier! Great band, awesome show, nice crowd! Can't wait for Pt. 2!

There were 3 more candidates for my AOTY. In order of preference:

TesseracT - War of Being
DVNE - Voidkind
VOLA - Friend of a Phantom

Cover image of Persefone - Lingua Ignota. Picture of something that looks like long hair without a head.

#aoty #aoty2025 #metal

**cuNha** @mudaste@mstdn.social · Nov 27, 2024

Nov 27, 2024

cuNha @mudaste@mstdn.social

Que programas #OCR #OpenSource recomendais para #Linux?

Já experimentei vários baseados em #tesseract e descobri recentemente o #Rescribe que faz um trabalho razoável.

https://rescribe.xyz/

rescribe.xyzRescribe OCR

**The Krononaut Moon Project** @KronoMoon@me.dm · Oct 24, 2024 *

Oct 24, 2024 *

The Krononaut Moon Project @KronoMoon@me.dm

#4dToys: a #Box of #Four #Dimensional #Toys

We've shared this #video before, but it's one of our favorites on the problem of #visualizing or #conceptualizing higher #dimensional #geometries. The 4th dimension is not always #Time, but is perpendicular to the lower 3. Watch as these objects slip in & out of our #World, like #TimeTravelers!

https://www.youtube.com/watch?v=0t4aKJuKP0Q 02 Jun 2017
https://Wikipedia.org/wiki/Four-dimensional_space

YouTube4D Toys: a box of four-dimensional toys, and how objects bounce and roll in 4DBy [mtbdesignworks {Miegakure, 4D Toys}]

#Community #TimeTravel #Research

**The Krononaut Moon Project** @KronoMoon@me.dm · Sep 29, 2024 *

Sep 29, 2024 *

The Krononaut Moon Project @KronoMoon@me.dm

A Beginner's Guide to the #FourthDimension

Combining simple images with multi-dimensional #animation, this 6-min video illustrates basic concepts of objects in a #4thdimension. If you find this interesting, you can search on other animated videos of 4 #dimensions and higher — some being many times more complex than this one.

https://www.youtube.com/watch?v=j-ixGKZlLVc 01 Jul 2016

YouTubeA Beginner's Guide to the Fourth DimensionBy The Science Elf

#Community #TimeTravel #Research

**Preston Maness ☭** @aspensmonster@tenforward.social · Aug 17, 2024

Aug 17, 2024

Preston Maness ☭ @aspensmonster@tenforward.social

My #tesseract #python project for automating enrichment of acta data on the #venezuela election is... maybe actually done? I might try and tweak some more bits -- namely, see if it's worth going character by character for the GUIDs and sha256 hashes -- but, for the most part, the labeling script I've got whipped up is knocking out every acta I throw at it at this point (except the absolute worst ones). It grabs the datetime, voting center info, geo info, qr code info, members of the mesa (names and citizen IDs), basically everything.

**Preston Maness ☭** @aspensmonster@tenforward.social · Aug 14, 2024

Aug 14, 2024

Preston Maness ☭ @aspensmonster@tenforward.social

FINALLY getting some decent success and accuracy rates with #tesseract after an absurd amount of pre-processing. If you want high accuracy, you reeeeally have to baby it. But, on the other hand, stuff like this was still science fiction just 20 or so years ago, so... maybe I shouldn't be too harsh.

Now, to clean up this monstrosity of a "script" that's now clocking in at 1197 lines.

**Preston Maness ☭** @aspensmonster@tenforward.social · Aug 14, 2024

Aug 14, 2024

Preston Maness ☭ @aspensmonster@tenforward.social

First attached image:

$ tesseract --dpi 72 row-13.jpg stdout
Empty page!!
Empty page!!
$

Second attached image:

$ tesseract --dpi 72 test.jpg stdout
Cl: 7823632
$

God damn #tesseract you're really needy.

Recent searches

Search options

Administered by:

Server stats:

#tesseract