When “idle” isn't idle: how a Linux kernel optimization became a QUIC bug

blog.cloudflare.com

93 points by sbulaev 11 hours ago


neuralkoi - 5 hours ago

I can see why they rewrote QUIC in Rust and for use in userspace, though going the in-house approach would warrant keeping an eye on the relevant kernel commits like a hawk to avoid missing bug fixes like these. These in-house implementations tend to have less eyeballs than the kernel.

I found it interesting that Cloudflare is not yet using BBR as the default in quiche. CUBIC's recovery in this day and age, and especially in datacenters with large pipes, seems so slooow to me. Almost two seconds with no loss whatsoever till achieving BDP again and then shooting itself in the foot every time it hits the ceiling. Each one of those losses a retransmission.

echoangle - an hour ago

Looking at the last plot, it seems like the backoff is roughly 1/5 of the total bandwith and it happens every 50 ms or so. Wouldn't it make sense to reduce the backoff and the growth speed if a backoff occurs repeatedly in rapid succession? We want to maximize the area under the curve (transmitted packages), right?

blahgeek - 8 hours ago

The more precise title should be: How we copied code from Linux kernel without fully understand it and missed its follow-up fixes, now it bites us

extropy - 4 hours ago

[flagged]