Hacker Newsnew | past | comments | ask | show | jobs | submit | deckar01's commentslogin

Wiggling the mouse is what people do involuntarily when the computer isn’t working right. They are setting themselves up for Gemini to be the uninvited Clippy, except this will send everything you are working on to Google to harvest data from.

The video they show (which is probably exaggerated by cutting out LLM generation time) is pretty sci-fi. I don't know how it works in practice, but it looks fun to try out. If this could run locally, I'd love to have a feature like that.

Most people don't really seem to care about data collection when it comes to AI usage. A lot of people who will feed Gemini/ChatGPT/Bing/Claude/shady clusters across the internet for bargain bin prices/Mistral every detail of their lives will probably be fine with Gemini as long as it doesn't interfere unnecessarily.


It probably works similar to how Gemini works in Android for a while now.

You can point or select anywhere on the screen and it understands and searches the context. If you select a text block, even text inside an image, it allows to copy or search the text online. Otherwise it can search the image.

I use it often. It's intuitive and fast even on non-flagship phones.

I'd wager their A/B tests went well enough to warrant a port from phones to their new "Chromebook".


Their video is completely different from what Gemini does now. It analyses mouse movements, like circling around things, underlining things with the mouse, pointing at things to indicate where they need to go. It's a lot like the interfaces you might see in sci-fi movies, where generic gestures are understood within context in a way that modern computers can't handle.

> circling around things, underlining things with the mouse

Do we use the same Android Gemini assistant?

Because the one I use does that and it has object detection smart enough to be intuitive. It usually gets it right when I point something on the screen. And when it doesn't, I can circle around the thing or just click again.

This Instagram post for example, it automatically highlighted the entire person, but I wanted to know about the shoes. I then clicked once on the shoes and it knew exactly what I wanted and gave me the info in about 2 seconds:

https://imgur.com/a/lHUeciy

This is useful to non tech savvy folks. Not just to us hackers.


Google's Gemini features differ per region to a massive extent. There's a good chance privacy laws prevent Google from providing me with the same Gemini you use.

Object detection is mediocre at best. Circling things and using their AI editing features works, but the artefacts confuse Lens and other image parsing systems. Extracting objects from images usually mostly works, but it's not on par with what Apple had long before Google built it.

The difference remains that the Gemini app on Android requires activation. You cannot tap a button or click a link while you're on the Gemini screen.

The video isn't on the linked page anymore, but it's here: https://deepmind.google/blog/ai-pointer/ and here: https://www.youtube.com/watch?v=pZNzfQLgGsA

It's an absolute privacy nightmare for most people, but if we ever get enough RAM and compute to run this stuff locally, I think this can actually make a new paradigm for user interaction, something with lisp machine self-customisability but for people who don't know anything about computers.

And if it doesn't work, it'll be the most horrific, messy, useless UI humanity has ever invented, and we all get a new funny meme to laugh about Google. Win-win!


> Most people don't really seem to care about data collection when it comes to AI usage.

That assumes you intended to use AI. People are going to accidentally upload random private content to google.


If you buy the Google Gemini AI Agentic Laptop or whatever they will market this as, you're going to want to try AI. What else is the point of buying a Chromebook, as nice and slick as it may look, when similar or even better alternatives exist.

It's the unofficial "where's my mouse pointer" macro

At least one DE I've used (MacOS? KDE?) even had it as an official macro that would make the pointer 10x bigger when you shook it


MacOS and KDE both do this. In KDE the pointer keeps getting larger the longer you shake it until it is truly absurdly huge.

This might be the new Compiz, my friends love seeing this effect on KDE.

KDE does that by default. Handy sometimes, funny sometimes.

If you keep on shaking the mouse, the pointer just keeps getting bigger and bigger. You can certainly find it when it is enormous.

It is deliberately designed for maximum accidental invocations so the managers and execs behind it can claim the large user numbers in their promo packets.

I have a suspicion the room reverb of anything other than that guy’s lab is enough to break it.

> there was no sign of Polymarket, nor the entity it does business as

The law firm at that address was not their registered agent. Their ToS mandated arbitration with an entity that doesn’t exist.


That Vagabond Builds video... They edited it, but left in a claim it had a frunk. The commentary felt like engagement optimized stream of consciousness blather. They cut out shots of the car being moved and never showed it being driven.

I am surprised US prototyping companies don’t have more regional facilities. It costs ~10x more to have an Oklahoma machine shop cut a sheet metal pattern than to get it shipped from California. The difference seems to just be letting the customer make mistakes rather than requiring manual design verification. I probably would not have purchased a CNC machine if they offered 2 day shipping for <10% of the part cost.

I’m ready for a modern form of representation that isn’t constrained by how many people an old building can hold. I wish small groups could have a representative with a proportionally small fraction of voting power.

Then, why have groups at all? Direct democracy!

Mainly participation. Voter participation is already very low. It would be interesting if voting was more like jury duty and a random sample of the population was selected to vote per issue. That way there are no termed representatives to corrupt and participation is always significant and uniform.

Representation by random selection instead of popular vote was already proposed as a fix for many of the democracy's disadvantages.

I don’t remember it being covered in any of my government classes, and I had not considered alternatives to electoral representation until it started being so openly corrupted. Sortition USA looks interesting.

https://sortitionusa.org/


[flagged]


State ballot measures allow passing laws directly by citizen vote. Peaceful change is possible.

Not at the federal level then?

One side could grow the house fairly easily. There are actual proposals from current members.

I think about troubleshooting like OBST with test cost. Systems are a linear chain of points of failure. The more knowledge you have about how hard components are to test and which components break more often, the easier it is to choose the tests that optimize your time.

They let you write python programs as long as it’s from memory though. I wonder what the code golf looks like for a rudimentary python CAS. If you could evaluate the equation without needing to parse it, I bet you could get a lot of mileage out of a black box gradient decent routine. The analog circuit solver I wrote for my nSpire (without CAS) was ~11kB. https://github.com/deckar01/pylacc

> Why now

I am skeptical that they decided human input is their bottleneck just as the cost per token spiked from some AI providers. I see this as a way to reduce their compute spend (offloaded to the community), but I doubt they are going to give up any creative control, so their employee review bottleneck probably won’t change.


There is another solution: Get a machine with flow control and a pressure gauge on the group head. You can saturate the puck at low pressure to avoid dry pockets, then ramp the flow rate up until the group head pressure peaks. If the pressure starts to drop you can increase the flow to maintain the group head pressure.

As for the 6 bar course grind theory: You may maximize the extraction of soluble coffee mass, but the concentration will be lower. It does not take very much extra water to ruin the taste and texture of a latte.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: