Wednesday, March 4, 2020

Slingshotting to Exascale, It's Hot!

At the top of this episode, Henry notes that the temperature in his city will be touching -15F, which is plenty cold. However, it’s very good overclocking weather as Dan and Shahin point out. Not quite quantum weather, unfortunately.

Cray Slingshot Interconnect

We quickly get to the main topic of the day, an examination of HPE/Cray’s Slingshot interconnect. It’s Ethernet on HPC steroids and will be the interconnect of choice for their upcoming slate of Exascale systems. Slingshot includes a bunch of HPC enhancements while maintaining compatibility with existing Ethernet devices and protocols. Cray has designed a new Ethernet superset of features that includes smaller headers, support for smaller message sizes, plus other features aimed at cutting Ethernet latency and improving performance on HPC-oriented interconnect tasks. At the heart of this new interconnect is their innovative 64 port switch that provides a maximum of 200 Gb/s per port and can support Cray’s enhanced Ethernet along with standard Ethernet message passing. It also has advanced congestion control and quality of service modes that ensure that each job gets their right amount of bandwidth.
The architecture can scale to an astounding 279,040 endpoints, which is, as we note, “a lot of endpoints.” We also kick around the possibility that HPE/Cray might sell the interconnect as a standalone for use with competitive gear.

Cray Slingshot Interconnect

As mentioned on the call, the chips on this switch run so hot that they need liquid cooling – a first for interconnect processors. We also discuss the rising heat load coming from new CPUs and particularly ASICs and how network design can greatly impact costs. Listen to the show to learn about more, it’s a good and meaty discussion.

Why Nobody Should Ever be Online. Ever.

Henry’s latest reason why we need to abandon the internet cracks us all up. What’s so funny? It’s that the Phillips smart lightbulbs need a firmware upgrade in order to prevent miscreants from pwoning your entire network. No kidding, it’s true. And hilarious. Here’s the link. This has Henry thinking about how to protect his new home from war flying drones. He’s looking into drone killing home-based air defense systems or perhaps a whole-home Faraday cage.

Catch of the Week



Henry:  Another security related story, this time about low level exploits in the Cisco Discovery Protocol (CDP) that can expose tens of millions of devices to internet troublemakers. This is highly disturbing since there is so much Cisco gear out there and the fix relies on users updating their firmware to plug the holes. Ouch.

Jessi:  Brings athletics into the podcast, which is the cause of some banter about how totally un-athletic the rest of us are (with the exception of Jessi, of course). Nike is using big time computation to 3D print their new uppers to give athletes the ultimate advantage in shoe performance.

Shahin:  Alerts us to a comprehensive review of AMD’s Ryzen Threadripper 3990X, the first CPU in the world to sport 64 cores. This CPU is currently the top of AMD’s line and is just another signpost signaling AMD’s resurgence. Welcome back, AMD.

Dan:  As we covered in a prior episode, Microsoft had the fantastic idea of forcing their corporate Office 365 users to have Microsoft’s Bing installed as their default search engine, using an update to accomplish this task. Well, the users have spoken and their voice was heard loud and clear in Redmond. The company is retreating from their forced ‘upgrade’ to Bing and back pedaling with all due speed. Hee. Hee.

Listen in to hear the full conversation

* Download the MP3 
* Sign up for the insideHPC Newsletter
* Follow us on Twitter
Subscribe on Spotify 
Subscribe on Google Play 
Subscribe on iTunes 
RSS Feed
* eMail us

No comments:

Post a Comment