Hacker Newsnew | past | comments | ask | show | jobs | submit | unrahul's commentslogin

I have seen this flow in what people in some startups call "Agentic OCR", its essentially a control flow that is coded that tries pdf-parse first or a similar non expensive approach, and if it fails a threshold then use screenshot to text extraction.


Hey Danny, Really nice read.

Do you plan to share the source code to see if we could replicate this?


We are currently focused on our plans for the next phase of this but cleaning things up and open sourcing is something we could consider in the future!


We don’t want hex , can ask in a language that is not popular or the first 5 in the dataset , and it would answer , but not always will work with deep think . Using a tiny translator model in front of the api can make it more ‘open’.


Daniel is of the best engineers I have ever worked with. Engineer in the true sense of wanting to know how something works and figuring out ways to improve it !


Oh thanks!! Super high praise! :)


AI Developers, Startup Founders, Students: Ready for a Challenge? Join the "Advent of GenAI Hackathon" by Prediction Guard, with Intel Liftoff's support. A week-long journey into Generative AI awaits, packed with daily challenges to test your skills. Dive Deep into LLMs and experiment hands-on with Intel Corporation's AI Developer Cloud. Experience the power of Intel Xeon CPUs and Intel Data Center GPU Max! Build a Jupyter Notebook-based application that could win you cloud credits and recognition. Enroll by Dec 2: https://adventofgenai.com


I couldn't find online how to finetune LLMs on an Intel dGPU, so i made a simple version. This particular one can be used to generate text based on your favorite book (for eg). I hope you find it useful if you are having an Intel discrete GPU.


If you are one of those folks who use Linux and intel dGPUs (a tiny minority, I am sure :)) like me and is finding it difficult to set up a functional dev environment for the GPUs (Arc Alchemist, Datacenter Flex, GPU Max cards). This repo will help you set it up. I would love to get your feedback on this and on improvements that can be made. I made this for myself after I was tired of doing this many times. If there are any changes to the intel gpu docs, the repo gets auto-updated, so you can be sure this setup will work (to an extent).

I have also written a verification tool(https://github.com/rahulunair/xpu_verify) that can test if the setup is correct and help you fix it if it is not. The verification scripts will automatically run some C++ sycl parallel programming examples, AI examples using TensorFlow and PyTorch, and a few others. I would once again greatly appreciate any feedback on this.


Congrats!

Really nice work, I just bought it and put in on my reading list for today


I was playing around with PyO3 and thought of building a UUID wrapper for Python using Rust's UUID library.


for the core functionality, the tool pretty much does that, although I am thinking of adding trending repos and things as such.. and any other features that other folks have in mind, keeping the core utility of the tool as opening a repo in your editor


Also, i am looking at ways to make pulling the content the fastest way possible using cdn cached files where possible and things like that..


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: