Predicting the coronavirus outbreak: How AI connects the dots to warn about disease threats – GCN.com

Posted: March 5, 2020 at 6:24 pm

Predicting the coronavirus outbreak: How AI connects the dots to warn about disease threats

Canadian artificial intelligence firmBlueDothas been in the news in recent weeks forwarning about the new coronavirusdays ahead of the official alerts from the Centers for Disease Control and Prevention and the World Health Organization. The company was able to do this by tapping different sources of information beyond official statistics about the number of cases reported.

BlueDots AI algorithm, a type of computer program that improves as it processes more data, brings together news stories in dozens of languages, reports from plant and animal disease tracking networks and airline ticketing data. The result is an algorithm thats better at simulating disease spread than algorithms that rely on public health data -- better enough to be able to predict outbreaks. The company uses the technology to predict and track infectious diseases for its government and private-sector customers.

Traditional epidemiology tracks where and when people contract a disease to identify the source of the outbreak and which populations are most at risk. AI systems like BlueDots model how diseases spread in populations, which makes it possible to predict where outbreaks will occur and forecast how far and fast diseases will spread. So while the CDC and laboratories around the world race to find cures for thenovel coronavirus, researchers are using AI to try to predict where the disease will go next and how much of an impact it might have. Both play a key role in facing the disease.

However, AI is not a silver bullet. The accuracy of AI systems is highly dependent on the amount and quality of the data they learn from. And how AI systems are designed and trained can raise ethical issues, which can be particularly troublesome when the technologies affect large swathes of a population about something as vital as public health.

Its all about the data

Traditional disease outbreak analysis looks at the location of an outbreak, the number of disease cases and the period of time -- the where, what and when -- to forecast thelikelihood of the disease spreadingin a short amount of time.

More recent efforts using AI and data science have expanded the what to include many different data sources, which makes it possible to make predictions about outbreaks. With the advent of Facebook, Twitter and other social and micro media sites, more and more data can be associated with a location and mined for knowledge about an event like an outbreak. The data can include medical worker forum discussions about unusual respiratory cases and social media posts about being out sick.

Much of this data is highly unstructured, meaning that computers cant easily understand it. The unstructured data can be in the form of news stories, flight maps, messages on social media, check ins from individuals, video and images. On the other hand, structured data, such as numbers of reported cases by location, is more tabulated and generally doesnt need as much preprocessing for computers to be able to interpret it.

Newer techniques such asdeep learningcan help make sense of unstructured data. These algorithms run on artificial neural networks, which consist of thousands of small interconnected processors, much like the neurons in the brain. The processors are arranged in layers, and data is evaluated at each layer and either discarded or passed onto the next layer. By cycling data through the layers in a feedback loop, a deep learning algorithm learns how to, for example, identify cats in YouTube videos.

Researchers teach deep learning algorithms to understand unstructured data by training them to recognize the components of particular types of items. For example, researchers can teach an algorithm to recognize a cup by training it with images of several types of handles and rims. That way it can recognize multiple types of cups, not just cups that have a particular set of characteristics.

Any AI model is only as good as the data used to train it. Too little data and theresults these disease-tracking models deliver can be skewed. Similarly, data quality is critical. It can be particularly challenging to control the quality of unstructured data, including crowd-sourced data. This requires researchers to carefully filter the data before feeding it to their models. This is perhaps one reason some researchers, includingthose at BlueDot, choose not to use social media data.

One way to assess data quality is by verifying the results of the AI models. Researchers need tocheck the output of their modelsagainst what unfolds in the real world, a process called ground truthing. Inaccurate predictions in public health, especially with false positives, can lead to mass hysteria about the spread of a disease.

Here is the original post:

Predicting the coronavirus outbreak: How AI connects the dots to warn about disease threats - GCN.com

Related Posts