More

tkyjonathan · on April 29, 2020

Isn't this DataOps?

schnitsel · on May 7, 2020

I was thinking the same, it could be that the OP isn't familiar with the term yet.

tkyjonathan · on April 15, 2020

Almost none of the company leaders or even VCs fully understand what AI even is or does. They just like to hear thats its there.

If you don't have some AI in your company, you won't get investors.

kavalg · on April 15, 2020

Is it really that bad? I thought investors care more about business model and growth potential.

developerdylan · on April 15, 2020

It isn't THAT bad. Investors definitely care more about model and growth potential. But you will certainly run into those who ask "How are you integrating AI into your platform?" without any real idea about what that means.

tkyjonathan · on April 8, 2020

"Normalization was built for a world with very different assumptions. In the data centers of the 1980s, storage was at a premium and compute was relatively cheap."

But forget to do normalisation and you will be paying 5 figures a month on your AWS RDS server.

"Storage is cheap as can be, while compute is at a premium."

This person fundamentally does not understand databases. Compute has almost nothing to do with the data layer - or at least, if your DB is maxing on CPU, then something is wrong like a missing index. And for storage, its not like you are just keeping old movies on your old hard disk - you are actively accessing that data.

It would be more correct to say: Disk storage is cheap, but SDRAM cache is x1000 more expensive.

The main issue with databases is IO and the more data you have to read, process and keep in cache, the slower your database becomes. Relational or non-relation still follows these rules of physics.

ranman · on April 8, 2020

> This person fundamentally does not understand databases.

Oh boy I do love hackernews :).

It sounds like you’ve spent a lot of your career in a SQL world. Have you worked a lot with DDB/MongoDB/Cassandra? If not then give it a whirl with more than a toy application and share your thoughts. Already done that? Try the brand new “constructive criticism” framework.

Instead of “this person fundamentally does not understand databases” based on 13 words in a 1200+ word article, consider: “I disagree with this statement and here’s why”.

You get all of the karma with none of the ad hominem! Win win!

tkyjonathan · on April 1, 2020

Location: UK, Horsham Remote: Yes Willing to relocate: No Technologies: Data Engineering, Data Architecture, Data Performance Engineering, Java, Python, SQL. Résumé/CV: https://www.linkedin.com/in/jonathanlevin/ Email: mail@jonathanlevin.co.uk

tkyjonathan · on March 23, 2020

Location: West Sussex, UK

Remote: Yes

Willing to relocate: No

Technologies: Data Architect, Data Modeling, Data Performance Engineer, SQL, MySQL, Percona, MariaDB, Galera, Postgres, Kafka, ElasticSearch, RDS, Redshift, Infobright, Java, Python, R.

Resume/CV: https://www.linkedin.com/in/jonathanlevin/

Email: mail@jonathanlevin.co.uk

tkyjonathan · on March 6, 2020

This is obvious to me. Since Hadoop came out, (a lot of) people have been giving up on even forming algorithms and just dumping data into machine learning and hoping for the best. I recall someone high up at Google complaining about it.

We need to get back to forming algorithms as well as concepts and first principles. We cannot and should not expect ML to brute force finding patterns and just sit back and relax.

Here is another prediction for you: we will not solve ray-tracing in games and movie CGI with more hardware. We will need some algorithm that gets us 80-90% of the way there in a smart way.

taneq · on March 6, 2020

This was my first thought. Well, to be more complete - smart algorithms beat dumb algorithms even if the dumb algorithms use hardware acceleration (unless the problem is trivial anyway.) Smart algorithms plus hardware acceleration beats smart algorithms on general purpose hardware. Smart algorithms are just better.

tkyjonathan · on March 5, 2020

I said this 3 years ago, the future is having an SQLite DB inside a container for each one of your customers.

/s

tkyjonathan · on Feb 28, 2020

If its minimum of 70k, then its probably a good idea, because you would only hire if you absolutely have to.

37signals would be proud.

If you decentivize the most productive people, then you might hurt the rest of the company.

tkyjonathan · on Feb 25, 2020

I reduce my RDS bill by 66% by keeping it on EC2 and using Percona Server.

tkyjonathan · on Feb 21, 2020

Batching is the multi-threadedness of databases.

Its also important to remember that in databases, you are more often optimising for IO usage than CPU.