Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I agree that it's not possible to have them do 13th century architectural style perfectly right now. But I believe it will be soon. The image/video models are improving, but so are the reasoning models, and they can check for and fix anachronisms.


I hope you're right. Are you aware of any image-gen models that apply chain-of-thought style reasoning (either agentic or via reinforcment learning to shape outputs?)

For example, consider this imagery from today's challenge: https://firebasestorage.googleapis.com/v0/b/fastab-f08e9.app...

These are some incredible monoliths: if they were real, I feel like I would have heard about them? And if they did... that's so cool. But because it's AI generated, I have a very low confidence level that this ever existed at all. Which is sad.


[Spoiler] I guess it's this: https://madainproject.com/northern_stelae_park

Which is funny, because the monoliths in the AI video look more eroded than the real ones today.

This looked like a nice idea at first glance. At second glance, it's really bad because you have to assume that everything you see in these videos can be wrong or misleading.


No, not aware of image models that do chain-of-thought reasoning. But there are vision models that do it, so you can have them review the generated images and iterate on the prompts.


Reasoning models aren't needed for this. The loss function for the image models needs to take year into account.

This is entirely possible, as the incredible accuracy[1] of non-generative picture location models (a very similar problem) shows.

[1] https://paperswithcode.com/sota/image-based-localization-on-...


Why not using img2vid starting from an historically accurate picture or painting?


This does use img2vid but with AI generated images. Using real pictures or paintings could definitely be fun too.


You might look into era specific LoRas if they exist, and if not consider training a few to help better capture architectural detail from that specific time frame.


good idea! It would be fun to have a ton of LoRas for different places x eras




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: