“Materials Informatics is an R&D paradigm shift; enabling discoveries and cutting the time to market”
Material Informatics could have a large impact.
(Source: Franki Chamaki (Unsplash))
Nearly every sector is proposing the use of artificial intelligence. Materials science R&D is relatively late to this trend, and there are many industry-specific hurdles, but the opportunities are beginning to be realised and the potential impact is significant.
Materials informatics is the use of data-centric approaches for materials science discovery and development. This is principally enabled by improved data infrastructures and machine learning solutions; this is set to be a paradigm shift in the way researchers conduct R&D projects and a discussion on why the adoption is now can be seen in a previous article. At this key moment of initial commercial adoption, ID Tech Ex has released the most comprehensive technical market report on the topic, ‘Materials Informatics 2020-2030’.
There are multiple potential advantages including identifying new species or relationships, extracting value from existing data, and generating use-case IP on existing compounds, but in most cases it is all about accelerating the time to market and providing a competitive advantage.
Materials informatics can be used at every stage of an experimental process.
(Source: ID Tech Ex)
Quantifying this accelerated time to market is difficult but essential for external companies to demonstrate and justify any investment. Many claim extensive examples of reducing millions of candidates and/or thousands of experiments to more manageable hundreds, or even tens, of solutions or iterations.
ID Tech Ex has classified the projects undertaken into six main categories outlined in detail within the report. Previous articles have shown how this has been used in multiple applications already.
Materials informatics can play a role at every stage of research. If this loop can be closed without human intervention it opens the opportunity for self-driving-labs. For more information see the ID Tech Ex report, ‘Materials Informatics 2020-2030’.
A key concept is the idea of an “inverse design”. In simple terms, this can involve training a model that allows properties to be input and formulations, compositions, process parameters or more to be proposed. The properties do not have to just be physical but could also be cost, toxicity, geographic availability or more. The technology is applicable to anyone that designs materials or designs with materials, an aim is to have this inverse design fully integrated with initial product design. This has been most effectively shown by the collaboration between Citrine Informatics and Siemens. It was stated that they want designers to view material as one of their “degrees of freedom” and allow materials companies to become “partners not vendors”.
For clarity, materials informatics is not to be confused with computational simulation (e.g. DFT calculations). This material modeling has seen major progressions over the past few decades (led by the likes of BIOVIA and Schrödinger) and with the continual improvement in computing power this will only increase. The announcement between JSR Corporation and QSimulate is notable recent evidence towards this. The data can be used in the same way as input data from any physical experimentation. In fact, a common approach of MI is used in reducing the number of costly and time-consuming simulations, facilitating these research projects, and drawing novel relationships.
The main problem is the limitations of the materials dataset. This is not like recognising objects in autonomous vehicles or sophisticated internet search engines, materials science brings numerous specific problems. The data is typically sparse, high-dimensional, biased, and noisy which means that understanding the uncertainty in the proposed output is essential; projecting out into the “unknown” is very challenging given the clustered, complex data.
There are many approaches to dealing with small datasets, this could involve generating one through high-throughput experimentation, leveraging external data repositories and most importantly integrating domain knowledge.
Generating and leveraging data repositories is a core theme of materials informatics. There are a wide number of very bespoke or more general repositories collecting published structures, properties, and other data. These are run by public or private organizations and, although may have limitations (such as unknown confidence in the data and biased by only having “positive” published results), they can be an unparalleled source for training models or screening for candidates. Not to mention large datasets opens the opportunity for utilizing more sophisticated deep learning methods.
Date: 08.12.2025
Naturally, we always handle your personal data responsibly. Any personal data we receive from you is processed in accordance with applicable data protection legislation. For detailed information please see our privacy policy.
Consent to the use of data for promotional purposes
I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here
Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.
Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.
In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.
Right of revocation
I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.
Accelerating the time from materials design to market is essential. The material development cycle is normally far slower than many end-user products and in certain sectors this development and qualification can be the bottleneck. Materials informatics can change that.