More

keeeba · 2025-12-11T21:07:38 1765487258

Doesn’t seem like this will be SOTA in things that really matter, hoping enough people jump to it that Opus has more lenient usage limits for a while

keeeba · 2025-11-25T22:07:23 1764108443

As a fairly extensive user of both Python and R, I net out similarly.

If I want to wrangle, explore, or visualise data I’ll always reach for R.

If I want to build ML/DL models or work with LLM’s I will usually reach for Python.

Often in the same document - nowadays this is very easy with Quarto.

Joel_Mckay · 2025-11-25T22:22:30 1764109350

Python has a list of issues fundamentally broken in the language, and relies heavily on integrated library bindings to operate at reasonable speeds/accuracy.

Julia allows embedding both R and Python code, and has some very nice tools for drilling down into datasets:

https://www.queryverse.org/

It is the first language I've seen in decades that reduces entire paradigms into single character syntax, often outperforming both C and Numpy in many cases. =3

pphysch · 2025-11-26T00:28:04 1764116884

Deeply ironic for a Julia proponent to smear a popular language as "fundamentally broken" without evidence.

https://yuri.is/not-julia/

kelipso · 2025-11-26T03:50:12 1764129012

This is like one of those people posting Dijkstra’s letter advocating for 0-based indexing without ever having read or understood what they posted.

pphysch · 2025-11-26T04:49:41 1764132581

What does indexing syntax have to do with Julia having a rough history of correctness bugs and footguns?

Joel_Mckay · 2025-11-26T09:40:34 1764150034

Sure, all software is terrible if looking at bug frequency history...

https://github.com/python/cpython/issues

Griefers ranting about years old _closed_ tickets on v1.0.5 versions on a blog as some sort of proof of lameness... is a poorly structured argument. Julia includes regression testing features built into even its plotting library output, and thus issues usually stay resolved due to pedantic reproducibility. Also, running sanity-checks in any llvm language code is usually wise.

Best of luck =3

pphysch · 2025-11-26T17:02:13 1764176533

Just saying, "other languages have bug reports" is a exceptionally poor way to promote Julia =3

Joel_Mckay · 2025-11-26T17:42:58 1764178978

To be blunt: Moores law is now effectively dead, and chasing the monolithic philosophy with lazy monads will eventually limit your options.

Languages like Julia trivially handle conditional parallelism much more cleanly with the broadcast operator, and transparent remote host process instancing over ssh (still needs a lot of work to reach OTP like cluster functionality.)

Much like Go, library resources ported into the native language quietly moves devs away from the same polyglot issues that hit Python.

Best of luck. =3

Joel_Mckay · 2025-11-26T01:02:09 1764118929

Python threading and computational errata issues go back a long time. It is a popular integration "glue" language, but is built on SWiG wrappers to work around its many unresolved/unsolvable problems.

Not a "smear", but rather a well known limitation of the language. Perhaps your environment context works differently than mine.

It is bizarre people get emotionally invested in something so trivial and mundane. Julia is at v1.12.2 so YMMV, but Queryverse is a lot of fun =3

keeeba · 2025-11-24T19:17:25 1764011845

Oh boy, if the benchmarks are this good and Opus feels like it usually does then this is insane.

I’ve always found Opus significantly better than the benchmarks suggested.

LFG

keeeba · 2025-11-07T20:55:35 1762548935

Please don’t actually use these 5,6,7-way Venn diagrams for anything practical, they’re virtually useless and communicate nothing.

roadside_picnic · 2025-11-07T22:06:59 1762553219

Technically a Venn diagram's entire point is to visualize all possible set relations between N sets. Their "practical" use is explicitly visualizing this.

In popular terminology they are very often confused with Euler Diagrams [0] which represent meaningful relations in sets but not all possible. You shouldn't create Euler Diagrams this complex, but the raison d'etre of Venn diagrams is to visualize the complex nature of set relations.

0. https://en.wikipedia.org/wiki/Euler_diagram

somat · 2025-11-08T07:28:36 1762586916

There is always the complicated wires puzzle from "Keep Talking and Nobody Explodes". Where a 5 way Venn diagram encodes what action you need to take for a given state.

https://bombmanual.com/web/index.html#ComplicatedWires

However you could make a good argument that having a complicated and confusing diagram is the point of that puzzle.

emmelaich · 2025-11-08T02:56:13 1762570573

Agree, I think the linked Upset diagram is better.

paulddraper · 2025-11-07T21:09:14 1762549754

Thanks, I was just about to do that!

keeeba · 2025-10-28T09:28:36 1761643716

I agree it is a profound question. My thesis is fairly boring.

For any given clustering task of interest, there is no single value of K.

Clustering & unsupervised machine learning is as much about creating meaning and structure as it is about discovering or revealing it.

Take the case of biological taxonomy, what K will best segment the animal kingdom?

There is no true value of K. If your answer is for a child, maybe it’ 7 corresponding to what we’re taught in school - mammals, birds, reptiles, amphibians, fish, and invertebrates.

If your answer is for a zoologist, obviously this won’t do.

Every clustering task of interest is like this. And I say of interest because clustering things like digits in the classic MNIST dataset is better posed as a classification problem - the categories are defined analytically.

keeeba · 2025-10-17T04:42:01 1760676121

“Skills are a simple concept with a correspondingly simple format.”

From the Anthropic Engineering blog.

I think Skills will be useful in helping regular AI users and non-technical people fall into better patterns.

Many power users of AI were already doing the things it encourages.

keeeba · 2025-09-25T17:58:11 1758823091

It came from nowhere to 1T tokens per week, seems… suspect.

keeeba · 2025-08-20T17:12:21 1755709941

What use-cases do you see for the 270M’s embeddings, and should we be sticking to token embeddings or can we meaningfully pool for sentence/document embeddings?

Do we need to fine-tune for the embeddings to be meaningful at the sentence/document level?

keeeba · 2025-08-05T20:26:46 1754425606

Anthropic say Opus is better, benchmarks & evals say Opus is better, Opus has more parameters and parameters determine how much a NN can learn.

Maybe Opus just is better

8n4vidtmkvmk · 2025-08-06T04:19:48 1754453988

Even if it's better on average, doesn't mean it's better for every possible query

keeeba · 2025-07-28T06:49:13 1753685353

How have you tested your recall in the long and short term? And what were the results?

photios · 2025-07-28T07:08:04 1753686484

Gut feeling, of course :)