Hacker Newsnew | past | comments | ask | show | jobs | submit | JTrehan's commentslogin

I'm still working on my PDF search engine for desktop: https://www.docgoblin.com/ I'm implementing a bookmark utility right now and hope to add support multiple E-books format in the near future.

A one time payment app - interesting (I'm also working on something with similar moneytization solution). How are things going? I'd love to know the experience of another solopreneur, what stack are you using? I wonder - What are you using to parse PDFs and extract the text? I found that is a nightmare when was doing something similar for WithAudio (my app). - Are you just extracting the text or you are doing any post processing to identify which lines belong to the same paragraph or not?

Things are going slow, but it is a passion project so it's ok :) A few people have bought a licence and it seems most people who try the app are very happy with it so I'm happy too.

The app is entirely in Java, with javaFX for the UI and Lucene for the search engine. To read and render PDFs I use PDFium.


I'm working on a self-hostable ebook library (https://github.com/colibri-hq/colibri), and currently tinkering with searching over book content. Have you written about your approach to search somewhere, perhaps? Would be very interested in learning how others go about this. Kudos for DocGlobin, looks great :-)

I am still working on Docgoblin (https://docgoblin.com) a Pdf search engine software based on Lucene, pdfium and JavaFX. The app is super fast and users are happy with it. I'm in the process of adding plain text files support and making the website look nicer.


Oh, thanks for this, this is going to make my life easier!


I am not the author.


I'm currently working on a PDF search engine that will index and then search through several thousand pdfs. As a tabletop role-player and former researcher, I've always wanted a simple tool to do this, and I hope I won't be the only one to benefit from it. This is my first project of this scale and I hope to be able to show it here soon.

On the technical side, it's a desktop application coded entirely in java with a javaFX interface. No AI or online data uploads, just the Lucene search engine and PDFium for PDF rendering.

Here is a very short demo : https://youtu.be/CGo9JRUByGA


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: