The power of interning: making a time series database 2000x smaller in Rust

chaospatterns@lemmy.world · edit-2 1 day ago

but completely backwards in thinking that an undocumented bluetooth backdoor is worse than the worst vulnerability found since the invention of the internet

Right HeartBleed was way worse than this, not on the same level. I wasn’t claiming the opposite.

I was responding to the comment that appeared to suggest they were on the same level.

chaospatterns@lemmy.world · edit-2 1 day ago

No way they’re on the same level. Heartbleed allowed for remote memory reads. This requires you to have access to change the firmware and just gives you some more APIs to control the WiFi system and possibly bypass firmware verification.

chaospatterns@lemmy.world · edit-2 1 day ago

It all depends on how it’s represented on disk though and how the query is executed. Sqlite only supports numbers and strings, and if you keep using a VARCHAR, a read of those rows are going to have materialize a string into memory inside the sqlite library. DuckDB has more types, but if you’re using varchars everywhere, something has to read that string into memory unless you can push down logic into a query that doesn’t actually have to read the actual value, such as one that can use indices.

The best way is to change the representation on disk, such as converting low-cardinality columns like the station into a numeric id. A standard int being four bytes is a lot more efficient than an n-byte string + a header and it can be compared by value.

This is where file formats, like Parquet, shine. They’re oriented more towards parsing by systems. JSON is geared towards human parsing.

chaospatterns@lemmy.world · edit-2 2 days ago

Back in early 2024, I got a survey asking me why I chose to cancel my prime membership and I gave them multiple reasons.

chaospatterns@lemmy.world · 4 days ago

The power of interning: making a time series database 2000x smaller in Rust

chaospatterns@lemmy.world · 4 days ago

Is abstraction killing civilization?

chaospatterns@lemmy.world · 7 days ago

The companion post, I Went To SQL Injection Court, goes into detail about the court process and witness testimony. One of the interesting things is just how different computer people think about security vs lawyers. Somebody might say that having a schema would help a malicious actor a small amount, and a lawyer will jump on that to deny the request. The idea that the schema would help a malicious actor is the same as a map helping a bank robber. The vault security and security guards are the relevant factors for this, not the map.

I’ll keep this in mind the next time I’m an expert witness in a computer case (based on this, I hope I’m not.)

chaospatterns@lemmy.world · 7 days ago

Losing a 5-year-long Illinois FOIA lawsuit for database schemas

chaospatterns@lemmy.world · 13 days ago

In this context, SKU refers to a variant of this product

chaospatterns@lemmy.world · 15 days ago

I use Jellyfin for music mostly and it struggles with metadata. For example, if a song has two artists on it and I edit to correct it, it won’t update correctly and I’ll edit up with the artist “Artist A; Artist B”.

chaospatterns@lemmy.world · 21 days ago

How do modern compilers choose which variables to put in registers?

chaospatterns@lemmy.world · 22 days ago

Have you tried a packet capture with Wireshark or tcpdump to see what it’s doing? It might give better clues than a general error message.

chaospatterns@lemmy.world · 28 days ago

On Windows the system wakes up when connected or disconnected from an AC adapter. On Linux the system will momentarily wake up but immediately go back into suspend.

I get why this could be a source of bugs, but if I unplug my laptop while its asleep why would I want it to turn on?

chaospatterns@lemmy.world · 1 month ago

I’m working on adding ActivityPub to my Hugo blog right now. I support RSS, but I figured AP support means that you can get it into your Mastodon feed or even Lemmy feed making it easy to follow. Additionally, commenting (assuming it doesn’t get taken over by spammers.)

chaospatterns@lemmy.world · 1 month ago

Which stops malicious usage, but doesn’t stop cases where web pages over use pushState as users move around instead of replaceState. I’ve seen maps that would add to the history every time a user moves around the map.

chaospatterns@lemmy.world · 1 month ago

Flash isn’t dead yet.

I just had to use it to connect to an ancient Siemens building automation system. Luckily we’re replacing it this year.

chaospatterns@lemmy.world · 1 month ago

Falsehoods programmers believe about null pointers

chaospatterns@lemmy.world · 1 month ago

Amazon Clarifies Climate Pledge was Just Pledge to Change Climate, Not Improve It

chaospatterns@lemmy.world · 1 month ago

I’m on Wayland and KDE/Plasma. It worked on GNOME, but sadly not on Plasma.

chaospatterns@lemmy.world · 2 months ago

Judge in Seattle blocks Trump order on birthright citizenship nationwide

chaospatterns@lemmy.world · edit-2 2 months ago

Not all filtering is the same. Client side filtering requires more data to passed over the network that then just gets dropped. It also means rules that are not shared across devices.

Most importantly, these use CSS filters which are computationally more expensive because it has to take an entire DOM element, serialize it to text, string search it vs a server side filter that can just look at a one or two field variables. Even if it’s not filtered in SQL on Lemmy’s side I’d say it’s still more efficient overall.

You do what you want, but adding extra work on the client side is not what I’d want for my users. Of course, if your Lemmy instance does not supporting filtering, then this is moot.

chaospatterns@lemmy.world · 2 months ago

And it’ll be faster and more efficient to do it server side as opposed to making uBlock Origin handle it.

chaospatterns@lemmy.world · edit-2 2 months ago

How many users are using browsers that are old enough they don’t even support JS? It’s one thing to disable it for security/privacy (which the OP was talking about), because those users are probably more tech savy.

chaospatterns@lemmy.world · 2 months ago

Do these old browsers not support DuckDuckGo?

chaospatterns@lemmy.world · edit-2 2 months ago

I tried self hosting Pixelfed but gave up because it wouldn’t work. I’m used to Docker containers that are able to just start up by themselves, but the guide didn’t work for me. Maybe it’s time to try again.

chaospatterns@lemmy.world · 2 months ago

One place it would be useful is if you are worried about somebody breaking into your home and stealing your computer. Don’t store the key on the home computer, instead store it on a cloud server. The home computer connects to the cloud server, authenticates itself with some secret, then if the cloud server authorizes, it can return the decryption key.

Then if your computer gets stolen or seized, it’ll connect via a different IP and the cloud server can deny access or even wipe the encryption key.

this doesn’t protect against all risks, but it has its uses.

Example: https://www.ogselfhosting.com/index.php/2023/12/25/tang-clevis-for-a-luks-encrypted-debian-server