> The JSON spec fits on two screen pages https://www.json.org/json-en.html It ab...

MrJohz · 2025-11-10T11:46:02 1762775162

I think you are misreading the phrase "based on". The author, I believe, intends it to mean something like "descends from", "has its origins in", or "is similar to" and not that the ECMAScript 262 spec needs to be understood as a prerequisite for implementing a JSON parser. Indeed, IIRC the JSON spec defined there differs in a handful of respects from how JavaScript would parse the same object, although these might since have been cleaned up elsewhere.

JSON as a standalone language requires only the information written on that page.

geocar · 2025-11-10T18:51:29 1762800689

> JSON as a standalone language requires only the information written on that page.

    JSON.parse("{\"a\":9999999999999999.0}")

Either no browsers implement JSON as written on that page, or you need to read ECMAScript-262 to understand what is going on.

MrJohz · 2025-11-11T11:00:08 1762858808

Well yes, if you're writing a JSON parser in a language based on ECMAScript-262, then you will need to understand ECMAScript-262 as well as the specification for the language you're working with. The same would also apply if you were writing an XML parser in a language based on ECMAScript-262.

If you write a JSON parser in Python, say, then you will need to understand how Python works instead.

In other words, I think you are confusing "json, the specified format" and "the JSON.parse function as specified by ECMAScript-262". These are two different things.

geocar · 2025-11-11T12:18:57 1762863537

> The same would also apply if you were writing an XML parser in a language based on ECMAScript-262.

Thankfully XML specifies what a number is and anything that gets this wrong is not implementing XML. Very simple. No wonder I have less problems with people who implement XML.

> In other words, I think you are confusing "json, the specified format" and "the JSON.parse function as specified by ECMAScript-262". These are two different things.

I'm glad you noticed that after it was pointed out to you.

The implications of JSON.parse() not being an implementation of JSON are serious though: If none of the browser vendors can get two pages right, what hope does anyone else have?

I do prefer to think of them as the same thing, and JSON as more complicated than two pages, because this is a real thing I have to contend with: the number of developers who do not seem to understand JSON is much much more complicated than they think.

MrJohz · 2025-11-11T14:20:09 1762870809

XML does not specify what a number is, I think you might be misinformed there. Some XML-related standards define representations for numbers on top what the basic XML spec defines, but that's true of JSON as well (e.g. JSON Schema).

If we go with the XML Schema definition of a number (say an integer), then even then we are at the mercy of different implementations. An integer according to the specification can be of arbitrary size, and implementations need to decide themselves which integers they support and how. The specification is a bit stricter than JSON's here and at least specifies a minimum precision that must be supported, and that implementations should clearly document the maximum precisions that they support, but this puts us back in the same place we were before, where to understand how to parse XML, I need to understand both the XML spec (and any additional specs I'm using to validate my XML), plus the specific implementation in the parser.

(And again, to clarify, this is the XML Schema specification we're talking about here — if I were to just use an XML-compliant parser with no extensions to handle XSD structures, then the interpretation of a particular block of text into "number" would be entirely implementation-specific.)

I completely agree with you that there are plenty of complicated edge cases when parsing both JSON and XML. That's a statement so true, it's hardly worth discussion! But those edge cases typically crop up — for both formats — in the places where the specification hits the road and gets implemented. And there, implementations can vary plenty. You need to understand the library you're using, the language, and the specification if you want to get things right. And that is true whether you're using JSON, XML, or something else entirely.

chrismorgan · 2025-11-10T12:01:30 1762776090

> my experience implementing API is that every XML implementation is obviously-correct

This is not my experience. Just this week I encountered one that doesn’t decode entity/character references in attribute values <https://news.ycombinator.com/item?id=45826247>, which seems a pretty fundamental error to me.

As for doctypes and especially entities defined in doctypes, they’re not at all reliable across implementations. Exclude doctypes and processing instructions altogether and I’d be more willing to go along with what you said, but “obviously-correct” is still too far.

Past what is strictly the XML parsing layer to the interpretation of documents, things get worse in a way that they can’t with JSON due to its more limited model: when people use event-driven parsing, or even occasionally when they traverse trees, they very frequently fail to understand reasonable documents, due to things like assuming a single text node, ignoring the possibilities of CDATA or comments.

drob518 · 2025-11-10T13:58:49 1762783129

Exactly. In my experience, XML has thousands of ways to trip yourself while JSON is pretty simple. I always choose JSON APIs over XML if given the choice.

geocar · 2025-11-10T18:56:35 1762800995

> This is not my experience.

Try not to confuse APIs that you are implementing for work to make money, with random "show HN AI slop" somebody made because they are looking for a job.

VMG · 2025-11-10T11:01:09 1762772469

The "References" section of the XML spec is almost longer than the JSON spec itself

> [...] serious XML implementation [...]

You are cherry-picking here

marcosdumay · 2025-11-10T14:23:27 1762784607

> advice to "always" treat numbers as strings

FFS, have your parser fail on inputs it can not handle.

Anyway, the book defining XML doesn't tell you how your parser will handle values you can't represent on your platform either. And it also won't tell you how our parser will read timestamps. Both are completely out of scope there.

The only common issue in JSON that entire book covers is comments.

The SOAP specification does tell you how to write timestamps. It's not a single book, and doesn't cover things like platform limitations, or arrays. If you want to compare, OpenAPI's spec fills a booklet:

https://swagger.io/docs/specification/v3_0/about/

vbezhenar · 2025-11-10T15:56:53 1762790213

> FFS, have your parser fail on inputs it can not handle.

I wish browser developers would understand that.

    JSON.parse("9007199254740993") === 9007199254740992