While machine learning has been around a long time, deep learning has taken on a life of its own lately. The reason for that has mostly to do with the increasing amounts of computing power that have become widely availablealong with the burgeoning quantities of data that can be easily harvested and used to train neural networks.
The amount of computing power at people's fingertips started growing in leaps and bounds at the turn of the millennium, when graphical processing units (GPUs) began to be harnessed for nongraphical calculations, a trend that has become increasingly pervasive over the past decade. But the computing demands of deep learning have been rising even faster. This dynamic has spurred engineers to develop electronic hardware accelerators specifically targeted to deep learning, Google's Tensor Processing Unit (TPU) being a prime example.
Here, I will describe a very different approach to this problemusing optical processors to carry out neural-network calculations with photons instead of electrons. To understand how optics can serve here, you need to know a little bit about how computers currently carry out neural-network calculations. So bear with me as I outline what goes on under the hood.
Almost invariably, artificial neurons are constructed using special software running on digital electronic computers of some sort. That software provides a given neuron with multiple inputs and one output. The state of each neuron depends on the weighted sum of its inputs, to which a nonlinear function, called an activation function, is applied. The result, the output of this neuron, then becomes an input for various other neurons.
Reducing the energy needs of neural networks might require computing with light
For computational efficiency, these neurons are grouped into layers, with neurons connected only to neurons in adjacent layers. The benefit of arranging things that way, as opposed to allowing connections between any two neurons, is that it allows certain mathematical tricks of linear algebra to be used to speed the calculations.
While they are not the whole story, these linear-algebra calculations are the most computationally demanding part of deep learning, particularly as the size of the network grows. This is true for both training (the process of determining what weights to apply to the inputs for each neuron) and for inference (when the neural network is providing the desired results).
What are these mysterious linear-algebra calculations? They aren't so complicated really. They involve operations on matrices, which are just rectangular arrays of numbersspreadsheets if you will, minus the descriptive column headers you might find in a typical Excel file.
This is great news because modern computer hardware has been very well optimized for matrix operations, which were the bread and butter of high-performance computing long before deep learning became popular. The relevant matrix calculations for deep learning boil down to a large number of multiply-and-accumulate operations, whereby pairs of numbers are multiplied together and their products are added up.
Over the years, deep learning has required an ever-growing number of these multiply-and-accumulate operations. Consider LeNet, a pioneering deep neural network, designed to do image classification. In 1998 it was shown to outperform other machine techniques for recognizing handwritten letters and numerals. But by 2012 AlexNet, a neural network that crunched through about 1,600 times as many multiply-and-accumulate operations as LeNet, was able to recognize thousands of different types of objects in images.
Advancing from LeNet's initial success to AlexNet required almost 11 doublings of computing performance. During the 14 years that took, Moore's law provided much of that increase. The challenge has been to keep this trend going now that Moore's law is running out of steam. The usual solution is simply to throw more computing resourcesalong with time, money, and energyat the problem.
As a result, training today's large neural networks often has a significant environmental footprint. One 2019 study found, for example, that training a certain deep neural network for natural-language processing produced five times the CO2 emissions typically associated with driving an automobile over its lifetime.
Improvements in digital electronic computers allowed deep learning to blossom, to be sure. But that doesn't mean that the only way to carry out neural-network calculations is with such machines. Decades ago, when digital computers were still relatively primitive, some engineers tackled difficult calculations using analog computers instead. As digital electronics improved, those analog computers fell by the wayside. But it may be time to pursue that strategy once again, in particular when the analog computations can be done optically.
It has long been known that optical fibers can support much higher data rates than electrical wires. That's why all long-haul communication lines went optical, starting in the late 1970s. Since then, optical data links have replaced copper wires for shorter and shorter spans, all the way down to rack-to-rack communication in data centers. Optical data communication is faster and uses less power. Optical computing promises the same advantages.
But there is a big difference between communicating data and computing with it. And this is where analog optical approaches hit a roadblock. Conventional computers are based on transistors, which are highly nonlinear circuit elementsmeaning that their outputs aren't just proportional to their inputs, at least when used for computing. Nonlinearity is what lets transistors switch on and off, allowing them to be fashioned into logic gates. This switching is easy to accomplish with electronics, for which nonlinearities are a dime a dozen. But photons follow Maxwell's equations, which are annoyingly linear, meaning that the output of an optical device is typically proportional to its inputs.
The trick is to use the linearity of optical devices to do the one thing that deep learning relies on most: linear algebra.
To illustrate how that can be done, I'll describe here a photonic device that, when coupled to some simple analog electronics, can multiply two matrices together. Such multiplication combines the rows of one matrix with the columns of the other. More precisely, it multiplies pairs of numbers from these rows and columns and adds their products togetherthe multiply-and-accumulate operations I described earlier. My MIT colleagues and I published a paper about how this could be done in 2019. We're working now to build such an optical matrix multiplier.
Optical data communication is faster and uses less power. Optical computing promises the same advantages.
The basic computing unit in this device is an optical element called a beam splitter. Although its makeup is in fact more complicated, you can think of it as a half-silvered mirror set at a 45-degree angle. If you send a beam of light into it from the side, the beam splitter will allow half that light to pass straight through it, while the other half is reflected from the angled mirror, causing it to bounce off at 90 degrees from the incoming beam.
Now shine a second beam of light, perpendicular to the first, into this beam splitter so that it impinges on the other side of the angled mirror. Half of this second beam will similarly be transmitted and half reflected at 90 degrees. The two output beams will combine with the two outputs from the first beam. So this beam splitter has two inputs and two outputs.
To use this device for matrix multiplication, you generate two light beams with electric-field intensities that are proportional to the two numbers you want to multiply. Let's call these field intensities x and y. Shine those two beams into the beam splitter, which will combine these two beams. This particular beam splitter does that in a way that will produce two outputs whose electric fields have values of (x + y)/2 and (x y)/2.
In addition to the beam splitter, this analog multiplier requires two simple electronic componentsphotodetectorsto measure the two output beams. They don't measure the electric field intensity of those beams, though. They measure the power of a beam, which is proportional to the square of its electric-field intensity.
Why is that relation important? To understand that requires some algebrabut nothing beyond what you learned in high school. Recall that when you square (x + y)/2 you get (x2 + 2xy + y2)/2. And when you square (x y)/2, you get (x2 2xy + y2)/2. Subtracting the latter from the former gives 2xy.
Pause now to contemplate the significance of this simple bit of math. It means that if you encode a number as a beam of light of a certain intensity and another number as a beam of another intensity, send them through such a beam splitter, measure the two outputs with photodetectors, and negate one of the resulting electrical signals before summing them together, you will have a signal proportional to the product of your two numbers.
Simulations of the integrated Mach-Zehnder interferometer found in Lightmatter's neural-network accelerator show three different conditions whereby light traveling in the two branches of the interferometer undergoes different relative phase shifts (0 degrees in a, 45 degrees in b, and 90 degrees in c).Lightmatter
My description has made it sound as though each of these light beams must be held steady. In fact, you can briefly pulse the light in the two input beams and measure the output pulse. Better yet, you can feed the output signal into a capacitor, which will then accumulate charge for as long as the pulse lasts. Then you can pulse the inputs again for the same duration, this time encoding two new numbers to be multiplied together. Their product adds some more charge to the capacitor. You can repeat this process as many times as you like, each time carrying out another multiply-and-accumulate operation.
Using pulsed light in this way allows you to perform many such operations in rapid-fire sequence. The most energy-intensive part of all this is reading the voltage on that capacitor, which requires an analog-to-digital converter. But you don't have to do that after each pulseyou can wait until the end of a sequence of, say, N pulses. That means that the device can perform N multiply-and-accumulate operations using the same amount of energy to read the answer whether N is small or large. Here, N corresponds to the number of neurons per layer in your neural network, which can easily number in the thousands. So this strategy uses very little energy.
Sometimes you can save energy on the input side of things, too. That's because the same value is often used as an input to multiple neurons. Rather than that number being converted into light multiple timesconsuming energy each timeit can be transformed just once, and the light beam that is created can be split into many channels. In this way, the energy cost of input conversion is amortized over many operations.
Splitting one beam into many channels requires nothing more complicated than a lens, but lenses can be tricky to put onto a chip. So the device we are developing to perform neural-network calculations optically may well end up being a hybrid that combines highly integrated photonic chips with separate optical elements.
I've outlined here the strategy my colleagues and I have been pursuing, but there are other ways to skin an optical cat. Another promising scheme is based on something called a Mach-Zehnder interferometer, which combines two beam splitters and two fully reflecting mirrors. It, too, can be used to carry out matrix multiplication optically. Two MIT-based startups, Lightmatter and Lightelligence, are developing optical neural-network accelerators based on this approach. Lightmatter has already built a prototype that uses an optical chip it has fabricated. And the company expects to begin selling an optical accelerator board that uses that chip later this year.
Another startup using optics for computing is Optalysis, which hopes to revive a rather old concept. One of the first uses of optical computing back in the 1960s was for the processing of synthetic-aperture radar data. A key part of the challenge was to apply to the measured data a mathematical operation called the Fourier transform. Digital computers of the time struggled with such things. Even now, applying the Fourier transform to large amounts of data can be computationally intensive. But a Fourier transform can be carried out optically with nothing more complicated than a lens, which for some years was how engineers processed synthetic-aperture data. Optalysis hopes to bring this approach up to date and apply it more widely.
Theoretically, photonics has the potential to accelerate deep learning by several orders of magnitude.
There is also a company called Luminous, spun out of Princeton University, which is working to create spiking neural networks based on something it calls a laser neuron. Spiking neural networks more closely mimic how biological neural networks work and, like our own brains, are able to compute using very little energy. Luminous's hardware is still in the early phase of development, but the promise of combining two energy-saving approachesspiking and opticsis quite exciting.
There are, of course, still many technical challenges to be overcome. One is to improve the accuracy and dynamic range of the analog optical calculations, which are nowhere near as good as what can be achieved with digital electronics. That's because these optical processors suffer from various sources of noise and because the digital-to-analog and analog-to-digital converters used to get the data in and out are of limited accuracy. Indeed, it's difficult to imagine an optical neural network operating with more than 8 to 10 bits of precision. While 8-bit electronic deep-learning hardware exists (the Google TPU is a good example), this industry demands higher precision, especially for neural-network training.
There is also the difficulty integrating optical components onto a chip. Because those components are tens of micrometers in size, they can't be packed nearly as tightly as transistors, so the required chip area adds up quickly. A 2017 demonstration of this approach by MIT researchers involved a chip that was 1.5 millimeters on a side. Even the biggest chips are no larger than several square centimeters, which places limits on the sizes of matrices that can be processed in parallel this way.
There are many additional questions on the computer-architecture side that photonics researchers tend to sweep under the rug. What's clear though is that, at least theoretically, photonics has the potential to accelerate deep learning by several orders of magnitude.
Based on the technology that's currently available for the various components (optical modulators, detectors, amplifiers, analog-to-digital converters), it's reasonable to think that the energy efficiency of neural-network calculations could be made 1,000 times better than today's electronic processors. Making more aggressive assumptions about emerging optical technology, that factor might be as large as a million. And because electronic processors are power-limited, these improvements in energy efficiency will likely translate into corresponding improvements in speed.
Many of the concepts in analog optical computing are decades old. Some even predate silicon computers. Schemes for optical matrix multiplication, and even for optical neural networks, were first demonstrated in the 1970s. But this approach didn't catch on. Will this time be different? Possibly, for three reasons.
First, deep learning is genuinely useful now, not just an academic curiosity. Second, we can't rely on Moore's Law alone to continue improving electronics. And finally, we have a new technology that was not available to earlier generations: integrated photonics. These factors suggest that optical neural networks will arrive for real this timeand the future of such computations may indeed be photonic.
Originally posted here:
D-Waves 500-Qubit Machine Hits the Cloud - IEEE Spectrum
- Physicists breed Schrdinger's cats to find boundaries of the | Cosmos - Cosmos [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- The application of three-axis low energy spectroscopy in quantum physics research - Phys.Org [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- Scientists 'BREED' Schrodinger's Cat in massive quantum physics breakthrough - Express.co.uk [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- Quantum Physics: Are Entangled Particles Connected Via An Undetected Dimension? - Forbes [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- The World Of Quantum Physics: EVERYTHING Is Energy : In5D ... [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- Introduction to quantum mechanics - Wikipedia [Last Updated On: May 3rd, 2017] [Originally Added On: May 3rd, 2017]
- A general election, like quantum physics, is a thing of waves and particles - The Tablet [Last Updated On: May 4th, 2017] [Originally Added On: May 4th, 2017]
- 14-Year-Old Earns Physics Degree From TCU CBS Dallas / Fort ... - CBS DFW [Last Updated On: May 11th, 2017] [Originally Added On: May 11th, 2017]
- Quantum Entanglement Persists Even Under High Accelerations ... - International Business Times [Last Updated On: May 11th, 2017] [Originally Added On: May 11th, 2017]
- Quantum Entanglement Persists Even Under High Accelerations, Experiments Reveal - International Business Times [Last Updated On: May 11th, 2017] [Originally Added On: May 11th, 2017]
- Quantum - Wikipedia [Last Updated On: May 11th, 2017] [Originally Added On: May 11th, 2017]
- Unbreakable quantum entanglement - Phys.Org [Last Updated On: May 11th, 2017] [Originally Added On: May 11th, 2017]
- Physics may bring faster solutions for tough computational problems - Phys.Org [Last Updated On: May 14th, 2017] [Originally Added On: May 14th, 2017]
- UBC researchers propose answer to fundamental space problem - CBC.ca [Last Updated On: May 17th, 2017] [Originally Added On: May 17th, 2017]
- Quantum Biology and the Frog Prince - ScienceBlog.com (blog) [Last Updated On: May 18th, 2017] [Originally Added On: May 18th, 2017]
- The Marriage Of Einstein's Theory Of Relativity And Quantum Physics Depends On The Pull Of Gravity - Forbes [Last Updated On: May 18th, 2017] [Originally Added On: May 18th, 2017]
- New Research May Reconcile General Relativity and Quantum Mechanics - Futurism [Last Updated On: May 18th, 2017] [Originally Added On: May 18th, 2017]
- The Bizarre Quantum Test That Could Keep Your Data Secure - WIRED [Last Updated On: May 20th, 2017] [Originally Added On: May 20th, 2017]
- Testing quantum field theory in a quantum simulator - Phys.org - Phys.Org [Last Updated On: May 20th, 2017] [Originally Added On: May 20th, 2017]
- A classic quantum test could reveal the limits of the human mind - New Scientist [Last Updated On: May 20th, 2017] [Originally Added On: May 20th, 2017]
- Teleportation Could Be Possible Using Quantum Physics - Futurism - Futurism [Last Updated On: May 22nd, 2017] [Originally Added On: May 22nd, 2017]
- Nobel winner to talk cats, computers and quantum physics - AroundtheO [Last Updated On: May 23rd, 2017] [Originally Added On: May 23rd, 2017]
- Could Ant-Man Beat Superman With Quantum Physics? - Heroic Hollywood (blog) [Last Updated On: May 26th, 2017] [Originally Added On: May 26th, 2017]
- Physicists Discover Geometry Underlying Particle Physics [Last Updated On: May 26th, 2017] [Originally Added On: May 26th, 2017]
- Home - Center for Quantum Activism [Last Updated On: May 26th, 2017] [Originally Added On: May 26th, 2017]
- Physics - Wikipedia [Last Updated On: May 26th, 2017] [Originally Added On: May 26th, 2017]
- What Quantum Physics Can Tell Us about the Afterlife ... [Last Updated On: May 26th, 2017] [Originally Added On: May 26th, 2017]
- A Quantum Physicist Explains How Ant-Man Can Beat Superman - Inverse [Last Updated On: May 28th, 2017] [Originally Added On: May 28th, 2017]
- Academic Journal: Quantum Physics Is 'Oppressive' to Marginalized People - National Review [Last Updated On: May 30th, 2017] [Originally Added On: May 30th, 2017]
- University of Arizona Scholar Creates a Feminist Brand of Physics to ... - Breitbart News [Last Updated On: June 1st, 2017] [Originally Added On: June 1st, 2017]
- Feminist Launches 'Intersectional Quantum Physics' to End Newton's 'Oppression' - PJ Media [Last Updated On: June 1st, 2017] [Originally Added On: June 1st, 2017]
- In atomic propellers, quantum phenomena can mimic everyday ... - Phys.Org [Last Updated On: June 1st, 2017] [Originally Added On: June 1st, 2017]
- Quantum physics is oppressive - Patheos - Patheos (blog) [Last Updated On: June 5th, 2017] [Originally Added On: June 5th, 2017]
- It's widely abused as a buzzword. But can quantum mechanics explain how we think? - National Post [Last Updated On: June 5th, 2017] [Originally Added On: June 5th, 2017]
- Quantum Physics and Love are Super Weird and Confusing, but This Play Makes Sense of Them Both - LA Magazine [Last Updated On: June 6th, 2017] [Originally Added On: June 6th, 2017]
- One step closer to the quantum internet by distillation - Phys.Org [Last Updated On: June 7th, 2017] [Originally Added On: June 7th, 2017]
- Solving systems of linear equations with quantum mechanics - Phys.Org [Last Updated On: June 10th, 2017] [Originally Added On: June 10th, 2017]
- Neural networks take on quantum entanglement - Phys.Org [Last Updated On: June 14th, 2017] [Originally Added On: June 14th, 2017]
- Chinese satellite breaks a quantum physics record, beams entangled photons from space to Earth - Los Angeles Times [Last Updated On: June 15th, 2017] [Originally Added On: June 15th, 2017]
- Cybersecurity Attacks Are a Global Threat. Chinese Scientists Have the Answer: Quantum Mechanics - Newsweek [Last Updated On: June 16th, 2017] [Originally Added On: June 16th, 2017]
- New Quantum-Entanglement Record Could Spur Hack-Proof Communications - Yahoo News [Last Updated On: June 18th, 2017] [Originally Added On: June 18th, 2017]
- What Is Quantum Mechanics? - livescience.com [Last Updated On: June 18th, 2017] [Originally Added On: June 18th, 2017]
- China sets new record for quantum entanglement en route to build new communication network - NEWS.com.au [Last Updated On: June 19th, 2017] [Originally Added On: June 19th, 2017]
- Physicists Demonstrate Record Breaking Long-Distance Quantum Entanglement in Space - Futurism [Last Updated On: June 21st, 2017] [Originally Added On: June 21st, 2017]
- Viewpoint: A Roadmap for a Scalable Topological Quantum Computer - Physics [Last Updated On: June 22nd, 2017] [Originally Added On: June 22nd, 2017]
- How Schrdinger's Cat Helps Explain the New Findings About the Quantum Zeno Effect - Futurism [Last Updated On: June 22nd, 2017] [Originally Added On: June 22nd, 2017]
- BMW and Volkswagen Try to Beat Apple and Google at Their Own Game - New York Times [Last Updated On: June 23rd, 2017] [Originally Added On: June 23rd, 2017]
- How quantum physics could revolutionize casinos and betting if you can understand it - Casinopedia [Last Updated On: June 23rd, 2017] [Originally Added On: June 23rd, 2017]
- Quantum thermometer or optical refrigerator? - Phys.org - Phys.Org [Last Updated On: June 23rd, 2017] [Originally Added On: June 23rd, 2017]
- Atomic imperfections move quantum communication network closer ... - Phys.Org [Last Updated On: June 24th, 2017] [Originally Added On: June 24th, 2017]
- DoE Launches Chicago Quantum Exchange - HPCwire (blog) [Last Updated On: June 26th, 2017] [Originally Added On: June 26th, 2017]
- Google to Achieve "Supremacy" in Quantum Computing by the End of 2017 - Big Think [Last Updated On: June 26th, 2017] [Originally Added On: June 26th, 2017]
- Physicists settle debate over how exotic quantum particles form - Phys.Org [Last Updated On: June 27th, 2017] [Originally Added On: June 27th, 2017]
- Physicists make quantum leap in understanding life's nanoscale machinery - Phys.Org [Last Updated On: June 27th, 2017] [Originally Added On: June 27th, 2017]
- How quantum trickery can scramble cause and effect - Nature.com [Last Updated On: June 28th, 2017] [Originally Added On: June 28th, 2017]
- Berkeley Lab Intern Finds Her Way in Particle Physics | Berkeley Lab - Lawrence Berkeley National Laboratory [Last Updated On: June 28th, 2017] [Originally Added On: June 28th, 2017]
- Quantum Physics News - Phys.org - News and Articles on ... [Last Updated On: June 28th, 2017] [Originally Added On: June 28th, 2017]
- Quantum computers are about to get real - Science News Magazine [Last Updated On: June 29th, 2017] [Originally Added On: June 29th, 2017]
- Physics4Kids.com: Modern Physics: Quantum Mechanics [Last Updated On: June 29th, 2017] [Originally Added On: June 29th, 2017]
- Payments Innovation - A Quantum World Of Payments - Finextra (blog) [Last Updated On: June 30th, 2017] [Originally Added On: June 30th, 2017]
- Why can't quantum theory and relativity get along? - Brantford Expositor [Last Updated On: June 30th, 2017] [Originally Added On: June 30th, 2017]
- New method could enable more stable and scalable quantum computing, physicists report - Phys.Org [Last Updated On: June 30th, 2017] [Originally Added On: June 30th, 2017]
- Telecommunications, Meet Quantum Physics - Electronics360 [Last Updated On: June 30th, 2017] [Originally Added On: June 30th, 2017]
- How young is too young to talk to kids about science? Never, says one quantum physicist - ABC Local [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- Supercool breakthrough brings new quantum benchmark - Phys.org - Phys.Org [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- Physics For Toddlers . News | OPB - OPB News [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- Quantum Physics Provide Evidence that the Future Influences the Past - Edgy Labs (blog) [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- This quantum theory predicts that the future might be influencing the ... - ScienceAlert [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- Physicists May Have Discovered One of the Missing Pieces of Quantum Theory - Futurism [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- Something New For Baby To Chew On: Rocket Science And ... - NPR - NPR [Last Updated On: July 9th, 2017] [Originally Added On: July 9th, 2017]
- A New Quantum Theory Predicts That the Future Could Be Influencing the Past - Big Think [Last Updated On: July 14th, 2017] [Originally Added On: July 14th, 2017]
- Basic Assumptions of Physics Might Require the Future to Influence ... - Gizmodo [Last Updated On: July 14th, 2017] [Originally Added On: July 14th, 2017]
- Scientists teleport particle into space in major breakthrough for quantum physics - The Independent [Last Updated On: July 14th, 2017] [Originally Added On: July 14th, 2017]
- Rockstar scientist David Reilly takes the axe to quantum physics - The Sydney Morning Herald [Last Updated On: July 14th, 2017] [Originally Added On: July 14th, 2017]
- Quantum Mechanics Could Shake Up Our Understanding of Earth's ... - Gizmodo [Last Updated On: July 14th, 2017] [Originally Added On: July 14th, 2017]
- The Standard Model of particle physics is brilliant and completely flawed - ABC Online [Last Updated On: July 17th, 2017] [Originally Added On: July 17th, 2017]
- Quantum mechanics inside Earth's core - Phys.org - Phys.Org [Last Updated On: July 17th, 2017] [Originally Added On: July 17th, 2017]
- Making a quantum leap in space research - Shanghai Daily (subscription) [Last Updated On: August 6th, 2017] [Originally Added On: August 6th, 2017]
- Unlocking the Secrets of Quantum Physics to Create New Materials - Yu News (blog) [Last Updated On: August 6th, 2017] [Originally Added On: August 6th, 2017]
- China's Silicon Valley aims to become the country's top research center - Abacus [Last Updated On: October 16th, 2019] [Originally Added On: October 16th, 2019]