More

west0n · on Dec 12, 2024

If you want to run a database on VPS with ssd disk, you'll need at least two replicas for data redundancy, which would cost about $10.

delusional · on Dec 12, 2024

The article scopes this as "for development, testing, demos, or short-lived workloads". Do you really need HA/replication for any of those workloads?

west0n · on Dec 12, 2024

but when the VPS fails, you lost all your data

delusional · on Dec 12, 2024

You probably need to define "fails" for me. I've never had a VPS straight up "fail" before, as in a hardware level, can't access the data, dead. You may temporarily lose access to the data, but I've always been able to recover said data.

d0100 · on Dec 12, 2024

VPS backups are $1 for the $5 1GB VPS

west0n · on Aug 6, 2024

I'm curious about how rqlite's performance compares to other distributed databases developed in Go, such as CockroachDB, Vitess, and TiDB.

jitl · on Aug 6, 2024

It’s going to have much lower write throughput, since SQLite is single-writer and on top of that you need to do Raft consensus. TiDB and CockroachDB can handle concurrent writes easily. Cockroach runs raft per “range” of 128mb of the key space, I’m not as familiar with TiDB. Vitess is an orchestration layer over MySQL, and MySQL handles concurrent writes easily.

otoolep · on Aug 6, 2024

rqlite creator here.

That's correct, there is a write-performance hit for the reasons you say. All Raft systems will take the same hit, and SQLite is intrinsically single-writer -- nothing about rqlite changes that[2]. That said, there are approaches to increasing write-performance substantially. See [1] for much more information.

Write-performance is not the only thing to consider though (assuming one has sufficient performance in that dimension). Ease of deployment and operation are also important, and that's an area in which rqlite excels[3] (at least I think so, but I'm biased).

[1] https://rqlite.io/docs/guides/performance/

[2] https://rqlite.io/docs/faq/#rqlite-is-distributed-does-that-...

[3] https://rqlite.io/docs/faq/#why-would-i-use-this-versus-some...

otoolep · on Aug 6, 2024

Oh, I also presented some performance numbers in a presentation to a CMU a couple of years back. A little out-of-date, but gives a order-of-magnitude sense. https://youtu.be/JLlIAWjvHxM?t=2690

The biggest performance improvement since is due to the introduction of Queued Writes. See https://rqlite.io/docs/api/queued-writes/

joostdecock · on Aug 6, 2024

> Ease of deployment and operation are also important, and that's an area in which rqlite excels

Amen. I've been building something appliance-like where I want to support clustering but I don't want to manage a database cluster inside the project.

Rqlite is so easy to run either stand-alone or clustered. It's a godsend.

And when people want postgres or whatever, I let them bring their own database. It's not hard to abstract a database storage layet if you plan ahead.

But if you want it to 'just work' rqlite is doing that with flying colors.

spmurrayzzz · on Aug 6, 2024

Relevant to the original inquiry — I really admire that you bring up the etcd and consul comparison right up front in the readme. For my own comprehension at least, it makes obvious the type of workloads for which you're optimizing and I appreciate that context as a past user of both of those stacks.

ClumsyPilot · on Aug 6, 2024

Maybe ETCD is a more appropriate comparison?

protosam · on Aug 6, 2024

Depending on what you’re using these tools for. If you want a locking manager and some meta data storage to help your distributed system maintain state, etcd is better for the job than rqlite for that. It’s a better zookeeper. With etcd you can hold a lock and defer unlocking if the connection is disrupted. Rqlite is not a good option for this.

otoolep · on Aug 6, 2024

Agreed, in the sense that while rqlite has a lot in common with etcd (and Consul too -- Consul and rqlite share the same Raft implementation[1]) rqlite's primary use case is not about making it easy to build other distributed systems on top of it.

[1] https://github.com/hashicorp/raft

protosam · on Aug 8, 2024

Every time I've looked at rqlite, it just falls short features-wise in what I would want to do with it. A single raft group does not scale horizontally, so to me rqlite is a toy rather than a tool worth using (because someone might mistake the toy as production grade software).

otoolep · on Aug 8, 2024

rqlite creator here.

That's clearly a mistaken attitude because both Consul and etcd also use a single "Raft group" and they are production-grade software.

Ruling out a piece of software simply because it doesn't "scale horizontally" (and only writes don't scale horizontally in practice) is a naive attitude.

protosam · on Aug 8, 2024

The qualifier here is for /my/ use cases. However I couldn't recommend rqlite over better options at the level of scale that it can fill.

One of the problems is if you're working with developers, the log replication contents is the queries, instead of the sqlite WAL like in dqlite. I know this is a work around to integrate mattn/sqlite3, but it's untenable in enterprise applications where developers are going to just think "oh, I can do sqlite stuff!". This is a footgun that someone will inevitably trigger at some point if rqlite is in their infrastructure for anything substantial. In enterprise, it's plainly untenable.

Another issue is if I want to architect a system around rqlite, it wont be "consistent" with rqlite alone. The client must operate the transaction and get feedback from the system, which you can not do with an HTTP API the way you've implemented it. There was a post today where you can observe that with the jetcd library against etcd. Furthermore to this point, you can't even design a consistent system around rqlite alone because you can't use it as a locking service. If I want locks, I end up deploying etcd, consul, or zookeeper anyways.

If I had to choose a distributed database with schema support right now for a small scale operation, it would probably be yugabyte or cockroachdb. They're simply better at doing what rqlite is trying to do.

At the end of the day, the type of people needing to do data replication also need to distribute their data. They need a more robust design and better safety guarantees than rqlite can offer today. This is literally the reason one of my own projects has been in the prototyping stage for nearly 10 years now. If building a reliable database was as easy as integrating sqlite with a raft library, I would have shipped nearly 10 years ago. Unfortunately, I'm still testing non-conventional implementations to guarantee safety before I go sharing something that people are going to put their valuable data into.

To simply say I'm "ruling out a piece of software because it doesn't scale horizontally" is incorrect. The software lacks designs and features required for the audience you probably want to use it.

Hopefully you find my thoughts helpful in understanding where I'm coming from with the context I've shared.

otoolep · on Aug 8, 2024

Wow, a lot there. Thanks for your comments.

>One of the problems is if you're working with developers, the log replication contents is the queries, instead of the sqlite WAL like in dqlite.

I think you mean rqlite does "statement-based replication"? Yes, that is correct, it has its drawbacks, and is clearly called out in the docs[1].

>Another issue is if I want to architect a system around rqlite, it wont be "consistent" with rqlite alone. The client must operate the transaction and get feedback from the system, which you can not do with an HTTP API the way you've implemented it.

I don't understand this statement. rqlite docs are quite clear about the types of transactions it supports. It doesn't support traditional transactions because of the nature of the HTTP API (though that could be addressed).

>Furthermore to this point, you can't even design a consistent system around rqlite alone because you can't use it as a locking service. If I want locks, I end up deploying etcd, consul, or zookeeper anyways.

rqlite is not about allowing developers build consistent systems on top of it. That's not its use case. It's highly-available, fault-tolerant store, the aims for ease-of-use and ease-of-operation -- and aims to do what it does do very well.

>If I had to choose a distributed database with schema support right now for a small scale operation, it would probably be yugabyte or cockroachdb. They're simply better at doing what rqlite is trying to do.

https://rqlite.io/docs/faq/#why-would-i-use-this-versus-some...

Of course, you should always pick the database that meets your needs.

>If building a reliable database was as easy as integrating sqlite with a raft library, I would have shipped nearly 10 years ago.

Who said it was easy? It's taken almost 10 years of programming to get to the level of maturity it's at today.

>They need a more robust design and better safety guarantees than rqlite can offer today.

That is an assertion without any evidence. What are the safety issues with rqlite within the context of its design goals and scope? I would very much like to know so I can address them. Quality is very important to me.

[1] https://rqlite.io/docs/api/non-deterministic/

protosam · on Aug 8, 2024

> That is an assertion without any evidence.

This seems like a lack of knowledge issue. The problems with rqlite are inherit in it's design as I've already articulated. You can literally start reading jepsen analyses right now and understand it if you don't already: https://jepsen.io/analyses

otoolep · on Aug 8, 2024

Can you be more specific?

"Evidence Dump Fallacy." This fallacy occurs when a person claims that a certain proposition is true but, instead of providing clear and specific evidence to support the claim, directs the questioner to a large amount of information, asserting that the evidence is contained within.

protosam · on Aug 8, 2024

You realize that your product offers no transaction support due to the HTTP API right?

otoolep · on Aug 8, 2024

Transactions -- or the lack thereof -- have nothing to do with the consistency guarantees offered by rqlite.

You may wish to read this:

https://github.com/wildarch/jepsen.rqlite/blob/main/doc/blog...

rqlite -- to the best of my knowledge and as a result of extensive testing -- offers strict linearizability due to its use of the Raft protocol. Each write request to rqlite is atomic because it's encapsulated in a single Raft log entry -- this is distinct from the other form of transactions offered by rqlite[1], but that second form of transaction functionality has zero effect on the guarantees offered by Raft and rqlite (they are completely different things, operating at different levels in the design). If you know otherwise I'd very much like to know precisely why and how.

[1] https://rqlite.io/docs/api/api/#transactions

protosam · on Aug 8, 2024

I won't be following up further. I've shared all I have to share on this topic. On a personal level, I'm actually disappointed in how you take to critical feedback about your product and don't seem to be interested in understanding the problem domain you're developing for.

https://gist.github.com/protosam/35880f46ed3f3e80a4e2ec47e6b...

west0n · on April 26, 2024

If we didn't have abstractions like POSIX, applications would need to write an adaptor for every supported file system.

west0n · on April 14, 2024

This analysis is very insightful.

emidoots · on April 14, 2024

no malice intended, but I will take my cloud provider's managed database service.. for which we get support, SOC2 compliance, years of proven stability, no major approval needed by my organization, and budget already approved - rather than jump on the latest 'kubernetes is eating the world' fad.

spxneo · on April 14, 2024

theres so many of these me-too links now ive just largely begun to ignore it

its soft spam

rjbwork · on April 14, 2024

Yup. These kinds of tangentially related ad-posts on HN make me actively hostile to whatever it is they're selling.

west0n · on April 11, 2024

As the founder and CEO of a two-year-old startup, seeking certainty in the direction amidst uncertainty (determining which products can bring customers and profits) should become my instinct.

west0n · on April 10, 2024

Interesting, another project implemented in Go that is compatible with MySQL server, alongside others like Vitess and TiDB.

west0n · on April 9, 2024

See some development frameworks for local-first apps support both SQLite and PostgreSQL. The advantage of using PostgreSQL is that when you add a cloud option to your local-first app, the migration becomes much easier.

west0n · on April 8, 2024

I bet that all message queues and log databases will support S3, as these types of data generally have a large volume and aren't as economically valuable (don't get me wrong, what I mean is that these databases won't be frequently read and processed).

chenyang · on April 8, 2024

Can't agree more! S3 will be the modern data storage primitive. Also, the move towards shared storage and separating compute from storage is a key trend in cloud-native architecture, enhancing scalability and cost-efficiency.

west0n · on April 2, 2024

Here is our approach: From Markdown to a Docusaurus Website via GitHub, gh Actions, and gh pages

https://baky0905.github.io/personal-website/blog/2021/04/02/...

west0n · on April 2, 2024

Looks like microVM.

bastawhiz · on April 2, 2024

That's a giant stretch. Kraft is a surname. I assumed it was an April fools joke that Kraft the cheese company was making their own cloud (which would be hilarious).

fhuici · on April 2, 2024

I wonder whether we could get Kraft to sponsor us :) . Unikraft used to be called Unicraft but was changed to a "k" for name clash reasons. Agree that names are hard.

doubled112 · on April 2, 2024

I jumped to Kraft Dinner and wondered why that was the kind of perception you’d want to cause people to have about your product.

Kraft Dinner and Kraft Singles, my two favourite almost cheese products.

Names are really hard.

jagged-chisel · on April 2, 2024

Sounds like craft. Seems more likely to me an intentional misspelling of “craft” or related to the Kraft Foods company if you’re in the US.

Absolutely nothing about this headline had me thinking about consensus protocols.

pelasaco · on April 2, 2024

LOL. Kraft is german for power/strength.

jagged-chisel · on April 2, 2024

I caught on to that immediately. My statement was meant to mean that it would so so unlikely to indicate the Raft protocol, that it’s more likely to have been a reference to these other things. It is not a reference to those things, but it even less of a reference to Raft.

fhuici · on April 2, 2024

Yes, we're a German company, it's a play on words :)

moooo99 · on April 2, 2024

Kraft is just a the German word for strength or force.

karolist · on April 2, 2024

The parent is downvoted because the original comment contained some stretch logic talking about possible confusion with Raft consensus protocol due to naming similarities, the current comment is after the ninja edit.

adhamsalama · on April 2, 2024

And Kraft (Kafka without Zookeeper).

fhuici · on April 2, 2024

Yes, our kraft CLI tool (https://github.com/unikraft/kraftkit/) definitely has a clash with KRaft .