TLS Version Intolerance in SSL Pulse

You often hear that TLS is the most important security protocol. Usually, the reasoning is that it’s very widely deployed and also that it works for many higher-level protocols. That’s certainly true, but for those who work more closely with these protocols there is another important aspect: we can learn so much about protocol design by carefully examining the evolution of TLS.

Update (13 January 2017): Since this post was originally published, the TLS 1.3 specification was amended to change how future protocol versions are negotiated. The new approach bypasses the intolerance problem described below. Thus deployment of TLS 1.3 will proceed largely without fear of breaking compatibility.

Protocol Version Intolerance

One important lesson we learned is never to use explicit protocol version numbers. In TLS, they have been a continuous source of trouble, leading to waste of time and security issues. The root cause is the fact that there are many servers that freak out when a client offers to negotiate a TLS version number they don’t understand. According to the protocol specification, they’re supposed to respectfully decline to use an unknown version and fall back to the best protocol they can offer.

Despite this being a server-side problem (and, at a deeper level, a library problem), in practice browsers get all the blame if they can’t communicate with a particular web site. This leads to a behaviour sometimes called voluntary protocol downgrade: after trying their best protocol version and failing, browsers try their second best, then third best, and so on, until hopefully they manage to establish an encrypted channel with the server.

Although this browser behaviour helps with interoperability, it slows down communication and also enables active network attackers to downgrade secure communication to the worst protocol version supported by both the client and server.

After many years of struggle, major browsers vendors eventually managed to disable the fallbacks. The fallback to SSL v3 was the first to go, prompted by the discovery of the POODLE attack. Other fallbacks were removed later, as you can see on this timeline of events in the SSL/TLS ecosystem.

TLS 1.3

The version intolerance problems will persist for as long as we continue to publish new protocol versions. Protocol fallbacks are relevant again because TLS 1.3 is close to being finished and we now have to face the harsh reality of many intolerant servers out there.

Testing for version intolerance in SSL Labs

Protocol version intolerance testing has been implemented in SSL Labs for a very long time. If you’re unfortunate to run into a server that’s intolerant to some of TLS 1.0, 1.1, or 1.2, you get a big warning. But such servers are getting increasingly rare; after all, most browsers implemented TLS 1.2 by the end of 2013, which means that system owners have had a few years to resolve the intolerance problems.

We also test for intolerance of TLS 1.3 and some other absurdly large version numbers. The TLS 1.3 results are now of immediate interest; the other tests possibly more of interest to protocol developers.

Given that the release of TLS 1.3 is imminent, we’re planning to make the intolerance warnings more prominent, as well as take it into account when grading servers. There will soon be a separate post about that.

Version intolerance tracking in SSL Pulse

Our SSL Pulse project, which monitors SSL/TLS configuration on about 150,000 most popular web sites, provides version intolerance information from the first ever scan we did in April 2012. You can see what that looks like in the following chart.

You can see that, in our most recent scan (July 2016), 3.2% servers from our data set don’t like TLS 1.3. A further 3.6% servers don’t like TLS 2.152. This doesn’t sound like much, but it’s a huge problem for browsers because it translates to thousands of sites, some very popular.

You will also notice a huge drop in the number of intolerant servers in May 2015, which warrants an explanation. When we first started measuring version intolerance, the problem was genuinely much bigger; at peak, with about 12% servers intolerant to TLS 1.3 and over 60% servers intolerant to TLS 2. With such numbers, there was little chance of introducing a new protocol version smoothly. But then someone realised that the problem could be reduced with only a slight tweak to the TLS 1.3 protocol.

If you look under the hood of any SSL or TLS protocol version, you will see that they all use two different version numbers. There’s one version used for the main protocol (TLS 1.0, 1.1, and so on), but there is also the record layer version. You can think about the record layer as a subprotocol. With two version numbers, there’s always been a confusion about what exactly to send in each field. Some clients would only use the main protocol version (e.g., TLS 1.2), but keep the record layer version at SSL 3 or TLS 1.0, to indicate that, at that layer, nothing changed. However, some clients would use the bigger protocol versions for both fields. Naturally, in our tests we simulated the worst case.

Anyway, it transpired that most problems arise from record layer intolerance. Given that no one actually needs two version numbers, the TLS 1.3 specification was subsequently changed to deprecate the record layer version number and fix it at TLS 1.0 (0x0301). Then, some time later, in May 2015, we started testing for version number intolerance under the new rules, which caused the big drop.

Browsers are getting into a problem every time, because they are not trying to prevent a problem in the first place, but when problem appears they try to fix it. The side effect to fix a problem is producing much bigger problem (security protocol fallback problem).

In my humble opinion browsers should start preventing a problem by implementing protocol intolerance checking now when TLS 1.3 is not out yet and harm is made yet. Browser should start checking like ones per session per domain if domain is TLS 1.3 tolerant and maybe save this data to browser server infrastructure to prevent network latency (this may be a privacy issue, so this should be implemented with privacy in mind like opt-in or similar). When TLS 1.3 intolerant server is found then at the top of browser display annoying little yellow bar, just e.g. like Firefox is now displaying if pop-up window is blocked [1]. Currently it would just inform users that page can cause a trouble in the future and maybe having some option to not display this bar for this domain again. But this annoyance primary target is not for end-users it is intended for developers/administrators to notice something is going to brake in the future and to start fixing the problem before it actually appears. Intention is to reduce the bad servers to minimum and so there would be no need to implement browser protocol fallback.

I know this may sound a radical suggestion, but we should ask ourself what is the alternative? Currently not doing anything at all, but when TLS 1.3 is released then we will award bad servers with browser protocol fallback mechanism and security jeopardize good servers. And when bad gays strike millions of users, then all of the world gets into stupid face: “Whaaaaat just happened.”?

From ssllabs.com point the best thing you can do is start degrading grade if browser is not tolerant to 1.3 protocol and so help reduce need for protocol fallback. I know ssllabs.com does not have ultimate authority, but it may have some effect, and some other people are going to have another effect and combine effects may have positive effect too.

[1] https://i.imgur.com/rrc3EHh.png

Reply to j-mailor

Comments Cancel reply

Michael Davey says:

August 2, 2016 at 8:18 AM

How can I determine the status of long handshake intolerance, TLS extension intolerance and TLS revision intolerance via the API?

Reply to Michael
1. Bhushan Lokhande says:
  
  August 3, 2016 at 2:43 AM
  
  Not available in API currently. It will available soon https://github.com/ssllabs/ssllabs-scan/issues/370
  
  Reply to Bhushan
j-mailor says:

August 8, 2016 at 11:10 PM

Browsers are getting into a problem every time, because they are not trying to prevent a problem in the first place, but when problem appears they try to fix it. The side effect to fix a problem is producing much bigger problem (security protocol fallback problem).

In my humble opinion browsers should start preventing a problem by implementing protocol intolerance checking now when TLS 1.3 is not out yet and harm is made yet. Browser should start checking like ones per session per domain if domain is TLS 1.3 tolerant and maybe save this data to browser server infrastructure to prevent network latency (this may be a privacy issue, so this should be implemented with privacy in mind like opt-in or similar). When TLS 1.3 intolerant server is found then at the top of browser display annoying little yellow bar, just e.g. like Firefox is now displaying if pop-up window is blocked [1]. Currently it would just inform users that page can cause a trouble in the future and maybe having some option to not display this bar for this domain again. But this annoyance primary target is not for end-users it is intended for developers/administrators to notice something is going to brake in the future and to start fixing the problem before it actually appears. Intention is to reduce the bad servers to minimum and so there would be no need to implement browser protocol fallback.

I know this may sound a radical suggestion, but we should ask ourself what is the alternative? Currently not doing anything at all, but when TLS 1.3 is released then we will award bad servers with browser protocol fallback mechanism and security jeopardize good servers. And when bad gays strike millions of users, then all of the world gets into stupid face: “Whaaaaat just happened.”?

From ssllabs.com point the best thing you can do is start degrading grade if browser is not tolerant to 1.3 protocol and so help reduce need for protocol fallback. I know ssllabs.com does not have ultimate authority, but it may have some effect, and some other people are going to have another effect and combine effects may have positive effect too.

[1] https://i.imgur.com/rrc3EHh.png

Reply to j-mailor
Frédéric Kayser says:

August 31, 2016 at 3:00 AM

There’s a small error in the article: “at peak, with about 12% servers intolerant to TLS 1.2 and over 60% servers intolerant to TLS 2.” but it should read “intolerant to TLS 1.3”.

Reply to Frédéric
1. Ivan Ristic says:
  
  August 31, 2016 at 4:28 AM
  
  It’s fixed now. Thanks!
  
  Reply to Ivan
Jon DeGeorge says:

May 28, 2018 at 8:56 AM

Is there going to be a grading penalty for TLS 1.3 intolerant servers?
Example: https://www.ssllabs.com/ssltest/analyze.html?d=feedback.apple.com

Reply to Jon