More

Runonthespot · 2025-09-07T21:59:03 1757282343

Nice catch- should be fixed in latest

Runonthespot · 2025-09-07T21:43:37 1757281417

Added Ruby, but Elixir not very well supported by tree sitter

Runonthespot · 2025-09-07T17:25:40 1757265940

interesting - can I ask you to try a ck --index . ?

postalcoder · 2025-09-07T17:51:17 1757267477

It'd be nice if respected gitignore. It's turning my M4 MBP into a space heater too.

Runonthespot · 2025-09-07T18:00:08 1757268008

coming up next.

mijoharas · 2025-09-07T19:16:41 1757272601

Fyi, I just grabbed the same lib that ripgrep uses. That bit is extracted iirc, and was quite nice and simple to use.

postalcoder · 2025-09-08T05:14:41 1757308481

I saw that you added it, thanks! I'll give this a shot for a few days.

Runonthespot · 2025-09-07T17:19:02 1757265542

BAAI/bge-small-en-v1.5 but considering switching this to google's latest gemmaembedding - it's fairly switchable.

Runonthespot · 2025-09-07T16:24:54 1757262294

we all know rust CLI tools are better right?

dang · 2025-09-07T20:36:25 1757277385

Please don't post misleading titles. This is in the site guidelines: https://news.ycombinator.com/newsguidelines.html.

Runonthespot · 2025-09-07T16:24:24 1757262264

I'll add rust, ruby, elixir, Clojure next. It says rust as it's written in rust, sorry about that!

Runonthespot · 2025-09-07T15:07:38 1757257658

Mainly I wrote it because I noticed Claude's "by design" use of grep meant it couldn't search the code base for things it didn't already know the name of, or find "the auth section". But equally, it's well documented that e.g. Cursor's old RAG technique wasn't that great.

My idea was to make a tool that just does a quick and simple embedding on each file, and uses that to provide a semantic alternative that is much closer to grep in nature, but allows an AI tool like Claude Code to run it from the command line - with some parameters.

Arguably could be MCP, but in my experience setting up a server for a basic tool like this is a whole lot of hassle.

I'm fairly confident that this is a useful tool for CC as it started using it while I was coding it, and even when buggy, was more than willing to work around the issues for the benefit of having semantic search!

furyofantares · 2025-09-07T16:24:08 1757262248

CC is so good with grep that I'm half expecting to clutter its context with bad results from semantic search. But also half optimistic at this just improving its search.

If you're getting useful results from hybrid mode that's very interesting to me since well-constructed grep that claude executes don't really look like they'd work great for semantic search to me! But intuition is often wrong on this stuff.

I am very curious your thoughts on speed. I'd rather any tools claude invokes be as fast as possible so it can get feedback immediately and execute again.

postalcoder · 2025-09-07T18:11:43 1757268703

if you’re concerned about context you can trivially make a hook that will prune your conversation history of older semantic search results.

i do a lot of context management with hooks for all sorts of tool calls.

furyofantares · 2025-09-07T18:36:34 1757270194

That sounds great - do you have any examples?

postalcoder · 2025-09-08T03:40:00 1757302800

For example I have a Stop hook that scans my messages to see which files we've worked on. It'll check to see if the changes to those files have been committed and, if not, it will prevent Claude from stopping and send it a message to commit the specific files in a specific style that includes the id of the current session. The same script also cleans up all previous instances of the same message in the conversation, saving like 5k tokens per session.

I have a lot of PreToolUse hooks that injects guideline messages whenever certain tools are called or bash commands run. My hooks also prune older versions of those out of context. All of the transcripts are in ~/.claude/projects/ in jsonl format and are hot-editable.

mikebiglan · 2025-09-07T17:54:20 1757267660

Starred the repo.

Went to the github repo and was expecting a section about Claude Code and best practices on how to set this up with Claude Code. Very curious to hear how that might work, especially with what you've found compared to Claude Code's love of grep.

jtbaker · 2025-09-07T19:18:32 1757272712

> Went to the github repo and was expecting a section about Claude Code and best practices on how to set this up with Claude Code. Very curious to hear how that might work, especially with what you've found compared to Claude Code's love of grep.

A write up on this would be great!

Runonthespot · 2025-09-07T14:35:52 1757255752

Fair comment- the initial thinking was to have both and in fact a hybrid mode too which fuses results so you can get chunks that match both semantically and on keyword search in one resultset. Later could add a reranker too.

alvis · 2025-09-07T15:14:42 1757258082

Or another way of thinking. How much is the penalty we are talking about for semantic vs conventional grep?

My thinking is that for large codebase, sorting embedding matches maybe more efficient than reading all files and hence there is no point to put semantic search behind a --semantic flag

Runonthespot · 2025-09-07T14:26:33 1757255193

Yes- files are hashed and checked whenever you search so index should always remain up to date. Only changed files are reindexed. You can also inspect the metadata (chunking semantics, embeddings). It’s all in the .ck sidecar

Runonthespot · 2025-09-07T14:19:41 1757254781

It supports most languages but needs a bit of tree-sitter setup to do semantic chunking. Let me know what languages you’d like added

t0mas88 · 2025-09-07T16:23:21 1757262201

Java would be useful as well for larger backend codebases.

Alifatisk · 2025-09-07T15:10:17 1757257817

Thanks for your quick response, most large codebases I've been fiddling on is Ruby!

Runonthespot · 2025-09-09T21:10:36 1757452236

Ruby support has been added!

Alifatisk · 2025-09-10T10:02:44 1757498564

Amazing how quick you were, thank you!

jcgl · 2025-09-09T05:58:39 1757397519

Go would be my top ask. Shell and make would be nice bonuses.

benzible · 2025-09-07T14:36:26 1757255786

I'd love to see elixir support.

Runonthespot · 2025-09-09T21:10:55 1757452255

Sadly, not great support for Elixir from tree-sitter but it should handle them generically as text files

benzible · 2025-09-15T02:23:48 1757903028

Are you familiar with https://github.com/elixir-lang/tree-sitter-elixir ?

Bigsy · 2025-09-07T14:48:05 1757256485

Clojure would be awesome