Published/Veröffentlicht: Dec 17, 2018

300 – How Processors Got So Fast

Rate/Vote

(average: 4.63)

Category: podcast, podcast (en) Tags: computing, electronics 15 Comments

Guest: Lex Augusteijn Host: Markus Voelter Shownoter: Stefaan Rillaert

Have you ever wondered how the processor in your phone or computer got so much more faster than what the increase in megahertz suggests? In this episode we talk with Lex Augusteijn about superscalar processors, pipelining, speculative execution, register renaming and the like. We also discuss concerns other than speed, in particular, energy efficiency.

Introduction

00:06:00

Speed optimizations in modern processors

00:42:11

Additional concerns

01:44:44

Comments/Kommentare

15 Responses to 300 – How Processors Got So Fast

Tim says:

December 17, 2018 at 17:43

Vielen Dank für die Episode. Spannendes Thema. Ich freue mich darauf, es bei dem schlechten in Ruhe zu hören.
Stephan Schulz says:

December 18, 2018 at 00:16

Tolles Thema für das Jubiläum. Leider hänge ich noch ein paar Folgen hinten, aber ich kann es kaum erwarten!
Markus says:

December 18, 2018 at 10:08

Bei dem schlechten :-) ?
Peter says:

December 19, 2018 at 10:15

Great show (as always)! What about an episode on GPUs (history, current and future technologies, fixed vs. programmable pipelines, deep learning, graphics)?
Dietmar Petras says:

December 19, 2018 at 16:50

Sehr schöne Abhandlung zu diesem komplexen Thema. Am Ende äussert ihr euch dazu, dass das Thema Processor Design eine eigene Episode wert sei. Ich könnte dazu ggf. Kontakt zu interessanten Gesprächspartnern vermitteln.
David Mullineux says:

December 20, 2018 at 19:34

The best episode for a long time (if not ever ). Fascinating and superbly engaging. Thanks so much.
Would be great to hear something more on how storage /memory tech has a managed to keep pace with Moore’s law. e.g. how storage arrays work and SSD ?
Markus says:

December 21, 2018 at 10:47

Thanks David for the high praise :-)
Adam dorrell says:

December 23, 2018 at 10:09

Markus I agree with David. One of your best episodes. Congratulations and many thanks. Lex was a super guest, you should consider another episode with him. I’m probably same generation as Lex and have followed the evolution of chip design as an amateur. The complexities of super scalar were vague to me before but you both managed to bring them alive in a clear way. I’ve already recommended this episode to my team. Best Adam
Markus says:

December 23, 2018 at 12:21

Thanks Adam :-)
Tim says:

December 28, 2018 at 23:21

Sorry für den Tippfehler bei dem Comment von mir oben. Bei dem schlechten Wetter … war gemeint. Die Folge ist wirklich toll und ich habe viel gelernt.
Stephan says:

February 2, 2019 at 23:46

I finally found the time to listen to this. Another very good episode. I was surprised about how much my old (1994) knowledge (mostly from Hennessy/Patterson) still applies. I missed a section on forwarding of results in the pipeline (which reduces the effect of data dependencies, because it can make a result available to the next instruction before the store). On the other hand, I much better understood why a two bit counter is great for branch prediction (especially in loops – if you leave the loop, you will not loose the correct (backward) prediction, so if you re-enter it, you will still correctly predict the frequent case).
Florian Lohoff says:

February 6, 2019 at 10:49

A side note on the cache eviction/invalidation topic.

There are basically 2 concepts which intermix.

Concept 1 is that the CPU (or the chipset) does Bus snooping. So every memory write transaction from an external e.g. PCI/PCIe device to memory comes past the chipset which tells the CPU to invalidate the cache line the memory transaction touches. This was pretty common in all the PC ará machines. The problem here is that the more CPUs/Cores you get you will have more than one bus, more than one Memory Interface etc. So Intel made up some transaction protocol between the CPU cores/chipsets which only tell the others which cache lines to invalidate.

The other concept is to let the Operating System do it itself. For example the early SGI/Mips machines were of this concept. So before letting the OS start a DMA transaction from an external storage device the OS had to invalidate the Data Cache lines currently in the CPU cache (And avoid loading new cache lines while DMA was running)
This got more complicated from the OS side of things and had some problems with speculative execution as sometimes the CPUs loaded stuff from memory while speculating addresses. It sometimes mispredicted addresses and loaded memory from the DMA region. This was from Mips R10000 and upwards the case. It was only fixable with some kind of bus snooping which later machines employed

So on x86/PC style machines manually invalidating the cache is a pretty rare thing (it is necessary though). On MIPS style machines you have to do it in most of the OS drivers whenever the external peripheral touches memory directly.

Flo
Frank Berg says:

May 11, 2019 at 12:25

AWESOME episode. First of all the topic is of course super interesting, but also the way the dialog was held is very pleasant. Very nice level of detail, could have been even a little more at some point….With every answer it became evident that Lex could go on about every subtopic for hours. For me it is a extremely satisfying experience to listen to such experts digesting most complex things into something that a non-expert can understand and value.
And Markus is of course a very intelligent and quick thinking interviewer.
Prabh says:

January 17, 2020 at 07:29

Hi, I have just started following you and this was the first podcast I listened to till the end. This has been an awesome experience. I came across your podcast on spotify so I am really thankful for that.
Markus says:

January 17, 2020 at 07:34

Wow, you are the first Spotify listener I know of (there are a few more, but nobody had contacted us yet). Cool :-)