Hacker Newsnew | past | comments | ask | show | jobs | submit | comcuoglu's commentslogin

Thank you. It seems largely ignored that LLMs still sample from a set of tokens based on estimated probability and the given temperature - but not on factuality or the described "confidence estimate" in the article. RAG etc. only move the estimated probabilities into a more factually based direction, but do not change the sampling itself


Chart.js (with the geo plugin for the choropleth chart) and three.js for the bubble-chart.


Thanks, is fixed now.


Both fixed now, thanks.


Also SQL Server and MSSQL


Also perhaps MariaDB and MySQL should be joined together as well.


This kind of input is exactly what I've hoped for submitting it here, thank you. I agree!


While this made me laugh and there is some truth to it, the nice thing when running the process described in the blog post is that you don't need to know what or how you want to count - the LLM has the knowledge to classify it correctly enough to get good estimations. Go and Rust are both good examples of words that have multiple meanings and are pre-/suffix to many other words.


In total numbers I got 539 jobs saying that they want Rust experience and 695 want Go experience. I think I should have added another line-chart showing the programming language distribution over time, thanks for the idea.


Thanks for looking this up. It's especially interesting bc if I search "golang" on LinkedIn jobs, I see 5,185 results (in the US), but I only get 148 results for "rust".

Hardly scientific, but shows the risk of using Hacker News to draw overly strong conclusions of language popularity.


Could you maybe link me one of those? I've googled a bit but didn't find ready-to-use DBs with that data.



Thank you, looks promising.


It's handy but I think for your use case, the regular API works fine. For instance, you could have just pulled all the whoishiring posts

https://hacker-news.firebaseio.com/v0/user/whoishiring.json?...

without the googling hoops. Not that this is very helpful after you're done!


https://news.ycombinator.com/item?id=40782787

Also the clickhouse dataset, which is free.

Google BigQuery can become very expensive.


I agree, I've realized too late that I should have introduced a "Hybrid" category in this.


Another thing to improve this, is to ask posters to add GLOBAL_REMOTE, COUNTRY_REMOTE or something that indicates is not local remote only (within the same country).


I would add one more category.

Beyond the in-office and the N-days-a-week-hybrid, you have within _actual_ remote roles:

- Country remote (mostly for taxation/regulatory)

- Time zone remote (remote first companies but constraint to within 2 or 3 hours of HQ time zone)

- Anywhere remote (actual remote but often as a contractor or EoR)


Yes, later this week I will follow up with something to tell a little bit about the animation and the sphere positioning, that graph was kind of the most fun in writing this blog post. Thank you for your feedback!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: