Hacker Newsnew | past | comments | ask | show | jobs | submit | thenickdude's commentslogin

Deep in the abyss, the dark lord stirs


I was trying out an IDE plugin for local LLM integration, but it would just make totally insane edits instead of what I asked for.

It turned out that the LLM-runner's default setting for handling queries that were too long was to silently delete the middle of the query. So the LLM saw the beginning of my ask, and the tail end of the code context, and nothing else.


I've run Japanese windows software under Wine, and with no configuration it was a sea of crashy mojibake. With the right locale configured, it worked fine.


At the bottom of each issue is a huge amount of whitespace, then they've stuffed it with a bunch of @'s to notify users.

https://i.imgur.com/LRotRlS.png

The link goes to a typo-squatted domain using an "l" instead of an "i".


It's all in aid of some streetsweeper being able to add "contributor to X, Y, Z projects!" to their GitHub résumé. Before LLMs were a thing I also received worthless spelling-incorrection pull requests with the same aim.


Are spelling correction PRs not welcome? I'd never put it on a résumé but if I'm following a README and I see a typo, I'll generally open a quick PR to fix that. (no automated tools, not scanning for typos, just a human reading a README).


> Are spelling correction PRs not welcome?

I think a true spelling correction would be welcome. But I think the kind BS attitude the GP is describing often leads to useless reformatting/language tweaks, because the goal isn't to make the repo better, it's to make a change for making a change's sake with as little effort as possible.


Kind of like how on the Salesforce support forums, there are a lot of incorrect answers from people who don't appear to have understood the question, but they all have "Please mark my answer as helpful" at the bottom. (And this started long before AI.) If there's an incentive to "contribute," even if it's something as small as being able to put "Salesforce support volunteer" on your LinkedIn page, and it's very easy to do so, you'll get a lot of low-effort (or worse) contributions.


I’ve always wondered how salesforce and Microsoft get such huge numbers of supporters doing that unpaid work for them. They’re often Indian sounding names so I suspect that one or more of the big consultancy or outsourcing companies uses it as a hiring signal.


A real improvement to the documentation or readme is welcome, even if it is only a minor improvement. I have put in small grammar PRs on some documentation myself.

On the flip side, I used to get a lot of spam PRs that made an arbitrary or net neutral change to our readme, presumably just to get "contributor" credit. That is not welcome or helpful to anyone.


> Before LLMs were a thing I also received worthless spelling-incorrection pull requests with the same aim.

I always find it a pity when someone has been clever and it's missed. "Spelling incorrection", get it? It's not a correction. It's the opposite.


Ah, you're right, I totally missed that! But tbh that makes less sense to me. If this was pre-LLM then this is suggesting that humans were opening pull requests that change correctly-spelled words to incorrectly-spelled words? Is that some weird attempt at steganography?


Nope, just a large number of people who don't know English. As an example in context, someone might run a spell check and decide "incorrection" is an error, removing the humor.


Depends on the project.


This means you should still have the email from legal@, right? In that case you can solve the mystery of how they managed to pass DMARC by sharing the headers from it.


This was not Google's decision, it was forced on them by the EU under their Digital Services Act:

https://9to5mac.com/2024/10/17/developers-address-phone-numb...

The same thing was applied to the Apple store at the same time.


This directive only apply if you are trading (aka making money) with the app. Why does Google force-collect all this information when you can opt to not be a trader (no in-app purchases, no ads, etc)?


We have that already, we call it "Stack Overflow"


hands down, thread winner


Interestingly when I asked GPT-4o (at least that's what it said it was):

>According to the instructions, which UnicodeCIDFont fonts should be used when generating PDFs?

It replies:

>When generating PDFs using reportlab for East Asian languages, you must use specific UnicodeCIDFont fonts depending on the language. According to the instructions, use the following:

>Korean: HeiseiMin-W3 or HeiseiKakuGo-W5 or HYSMyeongJo-Medium

>Simplified Chinese: STSong-Light

>Traditional Chinese: MSung-Light

>These fonts must be registered using pdfmetrics.registerFont(UnicodeCIDFont(font_name)) and applied to all text elements in the PDF when outputting those languages.

This list also has the Japanese fonts merged with the Korean list.

https://chatgpt.com/share/6895a4e6-03dc-8002-99d6-e18cb4b3d8...


They've released their report now, 10 further certificates were mis-issued:

https://bugzilla.mozilla.org/show_bug.cgi?id=1961406


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: