Richard Feynman once said that nobody really understands Quantum Mechanics. Today, the same can be said about Materials Informatics. The industry can be confusing, with "apples" often compared to "oranges," leading to wasted time researching and testing the wrong solutions. Mat3ra.com (formerly known as Exabyte.io) was founded before the term "Materials Informatics" even existed, and we have experienced the industry's emergence firsthand. In fact, Mat3ra has been mentioned among the key players in the industry of Materials Informatics at least four times in 2023 alone, and yet even we are still confused about what the term means! Why? Read on to find out.
In 2023, Meta, Google, and Microsoft [1-3] were among some of the main newsmakers in the rapidly growing [4,5] field of Materials Informatics
But what exactly does Materials Informatics mean, and why does it matter?
Richard Feynman once said: “I think I can safely say that nobody really understands Quantum Mechanics”. Today, it is safe to say the same about Materials Informatics. Market analysts, venture capitalists, corporate executives, startup founders, and anyone else who touches the industry operate in a confusing environment where “apples” are often compared to “oranges,” resulting in wasted time researching and/or testing the wrong solutions.
I started Mat3ra before "Materials Informatics" was a term, and I have been experiencing the emergence of the industry first-hand. In 2023, Mat3ra.com (fka Exabyte.io) has been mentioned among the key players in the industry of Materials Informatics at least four times [4], yet even I am confused about what the term means! So here’s my attempt at explaining it.
The term "Materials Informatics" gained traction around 2017, primarily through Lux Research's efforts, when they began tracking this emerging landscape. Here's a summary of the Materials Informatics timeline.
2000s: Foundational Period
2010s: Growth and Development
2020s: Rapid Advancements and Diversification
We speak about materials informatics today due to the emergence of the new paradigm of R&D - data-driven research. Previously, we could rely on (1) experimentation, (2) theory and solving analytical equations with pen and paper, and (3) computer simulations allowing us to calculate what can't be solved. Recently, we started having enough data produced by (1)-(3) to see its trends and rely on AI/ML techniques to provide scientific insights. Thus, all the Materials Informatics industry players represent multiple facets of this complex "tectonic" shift we experience and aim to facilitate this transition from an "Edisonian" approach, where we had to run thousands of expensive and slow experiments, to an "Einsteinian" approach guided by computation and AI.
Similarly, all solutions in this field touch two or more of the following three domains: (1) Materials Science and Chemistry, (2) Data Science, and (3) Computer Science.
Let's identify specific areas of R&D that are grouped into "Materials Informatics" today. To help "decompose" the problem space into domains, we suggest the following vectors:
Not to overcomplicate an already complex picture, but all the above can be further subdivided according to the type of materials concerned: Electronic Materials, Metals, Ceramics, Chemicals, Composites, Polymers, and Other.
Also, notably, item #2 above overlaps with the more established "Computer Aided Design" and "Computer Aided Engineering" industries, which existed for decades but had a relatively small level of penetration in materials and chemistry space (compared to mechanical engineering and/or electrical engineering, for example).
Before continuing, let me re-iterate that the thoughts below reflect my personal experience and understanding of the field and are not meant to accurately represent the objective state of the industry or the way that other players see themselves. The online information is often limited, especially for the high-touch/consultative solutions. That said, here's my attempt to present the Materials Informatics "playfield" and where some solutions would stand based on the above categorization.
There is no single industry of Materials Informatics. Despite the efforts of the market research professionals, instead of combining everyone into a single bucket, it would be better to "divide and conquer".
One of the challenges in understanding Materials Informatics lies in its broad application and interpretation. Currently, the term encompasses many companies and technologies which are fundamentally different. To make sense of this diversity, we need to subclassify by the following dimensions:
I. Independent Data Synthesis.
II. Black-box solutions vs. Open platforms.
III. Horizontal vs. Vertical Solutions.
The above three are not exhaustive but should provide a clearer way to distinguish between different offerings. The ability to perform data synthesis outside the AI/ML toolchain is a key differentiating factor. Many outsiders have difficulty understanding the difference because AI/ML generates data, too. However, using only AI/ML-generated data to train more AI/ML violates the "causality" principle. That's why we need input from the other layers of the figure in 3.1. In many practical cases, only physics-based simulations (maybe accelerated with AI/ML) can provide the required volume, velocity, variety, and veracity of data.
To conclude, Materials Informatics is a field of research facilitating the creation, deployment, and exchange of data-driven digital approaches involving AI/ML. At the core, it can be subdivided into two sections: (I) approaches allowing for Data Synthesis independent from resulting AI/ML, and (II) sole Data Analysis approaches.
How important is Materials Informatics? Well, everything you touch is a material. If we can improve how we discover and develop new materials by even small margins, we can directly affect many aspects of our lives, including important areas like decarbonization, reducing pollution, transition to renewable energy, and electronics beyond Moore. Materials Informatics is a pivotal field transforming how materials are discovered, developed, and applied, making it a vital area in modern science and technology.
Materials Informatics is like Wild-Wild West today - a vast, uncharted territory that has recently piqued the interest of Big Tech. It's a field ripe with potential but also complex and confusing. Today, like during the early days of Quantum Mechanics, no one truly understands what's happening. To make sense of it and to move forward faster, we need to sub-classify and draw the boundaries of the individual states of the Wild-Wild West - "California", "Arizona", "New Mexico," etc. to avoid wasted time and missed expectations. By sub-classifying and focusing our attention, we can navigate the landscape faster and pave the way for groundbreaking discoveries.
---
NOTE: this article was originally published at https://www.linkedin.com/pulse/wth-materials-informatics-timur-bazhirov-sophc