The Largest Suite of Cosmic Simulations for AI Training Is Now Free to Download; Already Spurring Discoveries – UConn Today – UConn Today

Posted: January 9, 2022 at 5:13 pm

Totaling 4,233 universe simulations, millions of galaxies and 350 terabytes of data, a new release from the CAMELS project is a treasure trove for cosmologists. CAMELS which stands for Cosmology and Astrophysics with MachinE Learning Simulations aims to use those simulations to train artificial intelligence models to decipher the universes properties.

Scientists are already using the data, which is free to download, to power new research, says project co-leader Francisco Villaescusa-Navarro, a research scientist with the Simons Foundations CMB (Cosmic Microwave Background) Analysis and Simulation group.

Villaescusa-Navarro leads the project with associate research scientists at the Flatiron Institutes Center for Computational Astrophysics (CCA) Shy Genel and Daniel Angls-Alczar, who is also a UConn Associate Professor of Physics.

Machine learning is revolutionizing many areas of science, but it requires a huge amount of data to exploit, says Angls-Alczar. The CAMELS public data release, with thousands of simulated universes covering a broad range of plausible physics, will provide the galaxy formation and cosmology communities with a unique opportunity to explore the potential of new machine-learning algorithms to solve a variety of problems.

The CAMELS team generated the simulations using code taken from the IllustrisTNG and Simba projects. The CAMELS team includes members of both projects, with Genel a part of the core team of IllustrisTNG and Angls-Alczar on the team that developed Simba.

About half of the simulations combine the physics of the cosmos with the smaller-scale physics essential for galaxy formation. Each simulation is run with slightly different assumptions about the universe for instance, regarding how much of the universe is invisible dark matter versus the dark energy pulling the cosmos apart, or how much energy supermassive black holes inject into the space between galaxies.

The researchers designed the simulations to feed machine-learning models, which will then be able to extract information from observations of the real, observable universe. With 4,233 universe simulations, CAMELS is the largest ever suite of detailed cosmological simulations designed to train machine-learning algorithms.

The data will enable new discoveries and connect cosmology with astrophysics through machine learning, says Villaescusa-Navarro. There has never been anything similar to this, with this many universe simulations.

The CAMELS dataset is already powering research projects, with a wide range of papers utilizing the data in the works.

Pablo Villanueva-Domingo of the University of Valencia in Spain led one such paper. He and his colleagues leveraged the CAMELS simulations to train an artificial intelligence model to measure the mass of our Milky Way galaxy plus its surrounding dark matter halo, and the nearby Andromeda galaxy and its halo. The measurements the first ever done using AI put our galaxys heft at 1 trillion to 2.6 trillion times the suns mass. Those estimates are roughly in line with those made by other methods, demonstrating the AI approachs accuracy.

Meanwhile, Villaescusa-Navarro headed an effort to use the CAMELS data to estimate the value of two parameters that govern the fundamental properties of the universe: what fraction of the universe is matter, and how evenly mass is distributed throughout the cosmos. First, he and his colleagues used CAMELS to generate maps such as the distribution of dark matter, gas and different properties of stars. Then, using the maps, they trained a machine-learning tool called a neural network to predict the values of the two parameters.

This is the same kind of algorithm used to tell the difference between a cat and a dog from the pixels of an image, says Genel, who co-authored the paper. The human eye cant determine how much dark matter there is in a simulation, but a neural network can do that.

The results showed the promise of leveraging CAMELS to precisely estimate such parameters in the future based on new observations of the universe, says Villaescusa-Navarro.

Its exciting to see what other new discoveries this will enable, he says.

Read more here:

The Largest Suite of Cosmic Simulations for AI Training Is Now Free to Download; Already Spurring Discoveries - UConn Today - UConn Today

Related Posts