# Schools

## Density-functional theory and beyond – high-throughput screening and big-data analytics, towards exascale computational materials science

### Organisers

- Francesc Illas
*(Universitat de Barcelona, Spain)* - Eliseo Ruiz
*(Universitat de Barcelona, Spain)* - Volker Blum
*(Duke University, Durham, NC, USA, USA)* - Carsten Baldauf
*(Fritz Haber Institute of the Max Planck Society (FHI), Berlin, Germany)* - Matthias Scheffler
*(Fritz Haber Institute of the Max Planck Society (FHI), Berlin, Germany)*

### Supports

### Description

We propose a ten-day "Hands-On" school on first-principles approaches to calculate the electronic structure and relevant properties of materials, targeting students and early-career postdocs and ranging from the basics up to some of the most advanced aspects of the field. We will discuss the intrinsic and numerical accuracy, efficiency, and reproducibility of the underlying approximations with a focus on density-functional theory (DFT), but also on quantum-chemistry methods and many-body perturbation theory. Application examples will include: structure sampling for Multiscale problems (from molecules, nanoclusters to solids/surfaces), ab initio statistical mechanics, electronic and heat transport and optical properties, and high-throughput materials discovery. We want to focus as well on screening/searching approaches in materials and chemical spaces and exascale challenge problems.

The morning sessions will feature extended keynote lectures, covering the current state-of-the-art of modern electronic structure theory. In the afternoon sessions of the weekdays and on the weekend in-between, participants will deepen selected topics in tutored hands-on sessions focused on paradigmatic physical problems that can be addressed by first-principles approaches today. As the school progresses we will increasingly highlight the burgeoning role of both big data analytics and exascale computing, which are rapidly becoming essential elements of an integrated approach to modern computational materials science.

Flash presentations sessions for the participants, scheduled early in the program, will allow students, tutors, and lecturers to engage with each other and seed an open, stimulating environment for the entire school. The proposed format was highly successful in the past (roughly biannual since 1994). We expect similarly broad community interest and participant enthusiasm for the proposed 2019 workshop.

The Hands-On school includes about 30 keynote lectures (50 minutes + 10 minutes of discussion), covering the current state of modern electronic-structure theory linked to ab initio thermodynamics, statistical mechanics and materials discovery. This includes the basic concepts, but also advanced topics and techniques that go beyond standard DFT and relevant topics from neighbouring fields, e.g., quantum chemistry and GW many-body theory. The school will be given by leading and renowned general experts (e.g., Tkatchenko, Rubio, Weitao Yang, Sanvito among others). The selection of lecturers from different countries reflects the diversity of the field and also the global character of the electronic-structure community. Participants will thus not only get an overview of the field but will also be able to create and extend their professional networks to current and future key players in the field. Most of the proposed lecturers have already indicated their interest in the school and agreed to participate.

* Introduction

To set the scene, the organisers will provide a brief overview of the topics and aims of the school. Following the title of the school, Matthias Scheffler will then open the school with a motivational lecture describing the role and recent achievements of high-throughput screening and big data analytics in materials discovery. Here, we will also look forward towards the exciting new frontier of exascale computing and the promises if holds for computational materials science.

* Implementing DFT

Density-functional theory (DFT) [1,2] is undoubtedly the most successful and influential electronic-structure approach in materials science, thanks to the computational efficiency and reasonable accuracy of current density functional approximations (DFAs) for many purposes. DFAs allow for the prediction of total energy based quantities like structural, elastic, and vibrational properties of solids, often in excellent agreement with experiment. Among the many flavours of DFAs, one finds: the local-density approximation (LDA) [2,3], generalized gradient approximations (GGAs) [4-7] or meta-GGAs [8-11], optimized effective potential methods [12], van der Waals density functional theory [13,14], “generalized” schemes such as hybrid [15-17] or double-hybrid functionals [18-20] and many more. While the resulting set of theories is powerful, choosing a DFA “for the right reasons” for practical computation simulations can pose significant challenges. Lectures by Volker Blum, Weitao Yang, Will Huhn, David Vanderbilt and Sergey Levchenko, Alexandre Tkatchenko.

* Accuracy and reproducibility

Even for the same underlying DFAs, a broad ecosystem of different implementations for electronic-structure theory exists, with characteristic strengths and weaknesses. Numerical accuracy, efficiency and reproducibility of the calculations using a given DFA depend more strongly on their numerical implementation than commonly thought [21,22]. The key choice is the form of the mathematical discretization or basis set, for example, by plane waves [23], Gaussian-type orbitals [24], linearized augmented plane waves [25], Wannier-functions [26-28], numeric atom-centred orbitals [29-31], and many more options. Important other physical choices, such as the treatment of core electrons, of the electrostatic potential and its boundary conditions, of relativistic effects, etc, are tightly coupled to the chosen basis set. Obviously, the same physical results for the same question should be obtained from each of these choices if properly implemented. The question of reproducibility is thus rapidly increasing in importance, most prominently evidenced in a recent community-wide effort which highlights the materiality of this topic by comparing 15 solid-state codes, using 40 different potentials or basis set types, assessing the quality of the GGA equation of state for 71 elemental crystals [21,22]. In order to assess reproducibility and accuracy for a given set of methods benchmark data sets have also been devised. Navigating the different options is a challenge for anyone active in the field, but this is especially true for new researchers. Lectures by: Will Huhn, Igor Ying Zhang and Carsten Baldauf.

* Python-ASE

Practitioners of modern computational materials science typically employ a number of different codes, each particularly suited for a certain type of calculation. Organising the running of appropriate codes, retrieving and analysing output data produced by varied sources, and transferring data between codes can all be performed under the umbrella of the Atomic Simulation Environment (ASE). This powerful general approach to simulation is based on the user-friendly Python programming language which is thus a pre-requisite for utilising ASE. Lectures will be given by ASE developers and expert users on both the Pythonic fundamentals and on concrete useful examples. Lecturers: Ask Larssen, Krystian Thygesen, Christian Carbogno and Bjork Hammer.

* Physical Properties and Electronic Excitations: DFT and beyond

Despite the popularity of DFT, it is well-documented that commonly used (semi)local and hybrid exchange-correlation functionals are often insufficient to address specific, fundamental phenomena, including charge transfer, weak dispersion interactions and so-called strongly correlated systems [32,33]. Recent developments in computational materials science include the introduction of sophisticated quantum-chemistry methods [34-38] and many-body Green's function theory [39-41] to condensed matter physics. Furthermore, new-generation DFAs inspired by the other fields are emerging very quickly [18-20,42-45]. For many practical problems involving electronic excitations, it is necessary to go beyond ground-state theory. Green's function based many-body techniques are employed from the condensed matter physics side: most often, G0W0 based on a fixed reference [46-48], self-consistent GW approaches [49-51], the Bethe-Salpeter Equation [52] for neutral excitations (e.g., for optical properties) etc. Such methods will be compared with modern TD-DFT approaches. Angel Rubio and Miguel Marques will present the basic knowledge and applications of these methods, and will also discuss the numerical accuracy and reproducibility of these methods using well-established benchmark datasets. David Casanova will provide an overview to spin-flip methods to study excited states. Xinguo Ren will focus on both basic concepts and recent progress in quantum chemistry and many-body perturbation theory. Gemma Solomon, Christian Carbogno and Stefano Sanvito will introduce methods to tackle transport properties on complex systems.

* Time and length scales

Molecular dynamics (MD) simulations at realistic conditions (i.e. including temperature) are a primary pathway to predict the properties of real materials and molecules, for example ensemble properties or vibrational spectra. Born-Oppenheimer or Car-Parrinello MD [53] with classical nuclei and Newton's equation seem relatively simple, but feature a number of numerical challenges. These range from integration artefacts, resulting in energy drifts, to the proper derivation of statistical ensemble averages, etc. Furthermore, the approximate (re)introduction of quantum nuclear effects [54], electronic excitations through explicitly time-dependent DFT [55], or even the correlated dynamics of electrons and ions [56] can be essential to achieve physically correct results, for instance for heat transport in solids. As an extension of the more widespread thermodynamic Monte Carlo methods, the kinetic Monte Carlo (kMC) method is a useful coarse-graining tool to simulate the long-time dynamics of processes occurring in nature [57,58]. These methods together with the basic knowledge of ab initio thermodynamics will be covered in lectures by Mariana Rossi, Luca Ghiringhelli, Sergey Levchenko, Karsten Reuter and Peter Kratzer.

* Navigating materials and compound space and big-data driven materials science

A unique promise of electronic structure theory is to serve as a fully stand-alone, unbiased predictive tool of new compounds with optimized target property. In order to realize this promise, property predictions must be available for a vast space of possible compound compositions and compound geometries / topologies for solids, clusters [59], and molecules [60-62]. With the rise of the Materials’ Genome Project, high-throughput sampling of large segments of chemical or materials space has become a very active field of research [63]. Electronic-structure theory is of particular importance as a solid foundation in order to apply multiscale methods [64,65]. Much effort is spent in the field of data-driven research, for instance, “data mining” [66,67] or “machine learning” [68-70]. These approaches are ideally highly automated, relying on enormous numbers of calculations and thus will increasingly require exascale computing. However, these methods are still young and thus subject to pitfalls, for instance, outliers due to computational or even unexpected technical errors. The following topics will be covered specifically: Sampling of large conformational spaces of molecules, clusters, and solids (Scott Woodley) as well as transition path investigations (Carsten Baldauf); machine learning and big data (Stefano Curtarolo), with more general lectures given by Carlos Mera Acosta and Luca Ghiringhelli.

* The exascale frontier

Finally, we take a speculative look to the future of computational materials science with lectures covering large-scale simulations and exascale computing. The need for increasing realism in materials simulations will requires a corresponding increase in computing power. In this respect exascale computing (i.e. 10^18 operations per second) is widely regarded to be the next important frontier in large-scale detailed materials simulation. The challenges face by both hardware and software for exascale computing and the types of simulations that such capabilities will permit are covered (lecture by Nicholas Hine).

In summary, our proposed school will cover the full breadth of electronic structure based research, beginning from the fundamental concepts, and all the way to the latest development in the field, including practical examples that are paradigmatic for the science, in principle agnostic of any specific code. In the organizers' experience, it is the comprehensive focus that makes this school attractive to a large segment of researchers entering the field. We note that the main workhorse for electronic-structure tutorials in the workshop will be the FHI-aims code, with which all organizers and tutors are familiar, but stress again that this is expressly not intended to be a code-specific workshop. This philosophy is also reflected in the list of invited speakers. Active contributors to other codes for density-functional theory calculations, including Hardy Gross, Angel Rubio, Miguel Marques, David Vanderbilt, David Casanova, Stefano Sanvito, Gemma Solomon and Nicholas Hine but also force fields code with Adri van Duin.

### References

References

1. Hohenberg, P. & Kohn, W. Inhomogeneous Electron Gas. Phys. Rev. 136, B864–B871 (1964).

2. Kohn, W. & Sham, L. J. Self-Consistent Equations Including Exchange and Correlation Effects. Phys. Rev. 140, A1133–A1138 (1965).

3. Slater, J. C. A Simplification of the Hartree-Fock Method. Phys. Rev. 81, 385–390 (1951).

4. Langreth, D. C. & Mehl, M. J. Beyond the local-density approximation in calculations of ground-state electronic properties. Phys. Rev. B 28, 1809–1834 (1983).

5. Becke, A. D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 38, 3098–3100 (1988).

6. Lee, C., Yang, W. & Parr, R. G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 37, 785–789 (1988).

7. Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 77, 3865–3868 (1996).

8. Perdew, J. P., Kurth, S., Zupan, A. & Blaha, P. Accurate Density Functional with Correct Formal Properties: A Step Beyond the Generalized Gradient Approximation. Phys. Rev. Lett. 82, 2544–2547 (1999).

9. Tao, J., Perdew, J. P., Staroverov, V. N. & Scuseria, G. E. Climbing the Density Functional Ladder: Non-empirical Meta-Generalized Gradient Approximation Designed for Molecules and Solids. Phys. Rev. Lett. 91, 146401 (2003).

10. Zhao, Y. & Truhlar, D. G. A new local density functional for main-group thermochemistry, transition metal bonding, thermochemical kinetics, and non-covalent interactions. J. Chem. Phys. 125, 194101 (2006).

11. Sun, J., Ruzsinszky, A. & Perdew, J. P. Strongly Constrained and Appropriately Normed Semilocal Density Functional. Phys. Rev. Lett. 115, 36402 (2015).

12. Görling, A. New KS Method for Molecules Based on an Exchange Charge Density Generating the Exact Local KS Exchange Potential. Phys. Rev. Lett. 83, 5459–5462 (1999).

13. Dion, M., Rydberg, H., Schröder, E., Langreth, D. C. & Lundqvist, B. I. Van der Waals Density Functional for General Geometries. Phys. Rev. Lett. 92, 246401 (2004).

14. Lee, K., Murray, E. D., Kong, L., Lundqvist, B. I. & Langreth, D. C. Higher-accuracy van der Waals density functional. Phys. Rev. B 82, 081101 (2010).

15. Becke, A. D. A new mixing of Hartree–Fock and local density functional theories. J. Chem. Phys. 98, 1372–1377 (1993).

16. Perdew, J. P., Ernzerhof, M. & Burke, K. Rationale for mixing exact exchange with density functional approximations. J. Chem. Phys. 105, 9982–9985 (1996).

17. Adamo, C. & Barone, V. Toward reliable density functional methods without adjustable parameters: The PBE0 model. J. Chem. Phys. 110, 6158–6170 (1999).

18. Grimme, S. Semiempirical hybrid density functional with perturbative second-order

correlation. J. Chem. Phys. 124, 034108 (2006).

19. Zhang, Y., Xu, X. & Goddard III, W. A. Doubly hybrid density functional for accurate descriptions of non-bond interactions, thermochemistry, and thermochemical kinetics. Proc. Natl. Acad. Sci. USA 106, 4963–4968 (2009).

20. Zhang, I. Y., Xu, X., Jung, Y. & Goddard III, W. A. A fast doubly hybrid density functional method close to chemical accuracy using a local opposite spin ansatz. Proc. Natl. Acad. Sci. USA 108, 19896–19900 (2011).

21. Lejaeghere, K., Speybroeck, V. V., Oost, G. V. & Cottenier, S. Error Estimates for Solid-State Density-Functional Theory Predictions: An Overview by Means of the Ground-State Elemental Crystals. Critical Reviews in Solid State and Materials Sciences 39, 1–24 (2014).

22. K. Lejaeghere et al., Reproducibility in density functional theory calculations of solids, Science 351, aad3000 (2016).

23. Ihm, J., Zunger, A. & Cohen, M. L. Momentum-space formalism for the total energy of solids. J. Phys. C Solid State Phys. 12, 4409 (1979).

24. Szabo, A. & Ostlund, N. S. Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory. (McGraw-Hill Publishing Company, 1989).

25. Andersen, O. K. Linear methods in band theory. Phys. Rev. B 12, 3060–3083 (1975).

26. Soler, J. M. et al. The SIESTA method for ab initio order N materials simulation. J. Phys. Condens. Matter 14, 2745 (2002).

27. Haynes, P. D., Skylaris, C.K., Mostof, A. A. & Payne, M. C. ONETEP: linear-scaling density-functional theory with plane-waves. Psi-K Newsl. 78–91 (2006).

28. Gillan, M. J., Bowler, D. R., Torralba, A. S. & Miyazaki, T. Order N first-principles calculations with the conquest code. Comput. Phys. Commun. 177, 14–18 (2007).

29. Delley, B. An all-electron numerical method for solving the local density functional for polyatomic molecules. J. Chem. Phys. 92, 508–517 (1990).

30. Koepernik, K. & Eschrig, H. Full-potential non-orthogonal local-orbital minimum-basis bandstructure scheme. Phys.Rev. B 59, 1743–1757 (1999).

31. Blum, V. et al. Ab initio molecular simulations with numeric atom-centred orbitals. Comput. Phys. Commun. 180, 2175–2196 (2009).

32. Cohen, A. J.,Mori-Sánchez, P. & Yang, W. Challenges for Density Functional Theory. Chem. Rev. 112, 289–320(2011).

33. Ruzsinszky, A. & Perdew, J. P. Twelve outstanding problems in ground-state density functional theory: A bouquet of puzzles. Comput. Theory Chem. 963, 2–6 (2011).

34. Marsman, M., Grüneis, A., Paier, J. & Kresse, G. Secondorder Møller–Plesset perturbation theory applied to extended systems. I. Within the projector-augmented-wave formalism using a plane wave basis set. J. Chem. Phys. 130, 184103 (2009).

35. Shepherd, J. & Grüneis, A. Many-Body Quantum Chemistry for the Electron Gas: Convergent Perturbative Theories. Phys. Rev. Lett. 110, 226401 (2013).

36. Booth, G. H., Grüneis, A., Kresse, G. & Alavi, A. Towards an exact description of electronic wave-functions in real solids. Nature 493, 365–370 (2013).

37. Del Ben, M., Hutter, J. & VandeV ondele, J. Second-Order Møller–Plesset Perturbation Theory in the Condensed Phase: An Efficient and Massively Parallel Gaussian and Plane Waves Approach. J. Chem. Theory Comput. 8, 4177–4188 (2012).

38. Pisani, C., et al. CRYSCOR: a program for the post-Hartree–Fock treatment of periodic systems. Phys. Chem. Chem. Phys. 14, 7615–7628 (2012).

39. Grüneis, A., Marsman, M., Harl, J., Schimka, L. & Kresse, G. Making the random phase approximation to electronic correlation accurate. J. Chem. Phys. 131, 154115 (2009).

40. Paier, J. et. al. Hybrid functionals including random phase approximation correlation and second-order screenedexchange. J. Chem. Phys. 132, 094103–094110 (2010).

41. Michaelides, A. et al. Preface: Special Topic Section on Advanced Electronic Structure Methods for Solids and Surfaces. J. Chem. Phys. 143, 102601 (2015).

42. Bates, J. E. & Furche, F. Communication: Random phase approximation renormalized many-body perturbation theory. J. Chem. Phys. 139, 171103 (2013).

43. Heßelmann, A. & Görling, A. Correct Description of the Bond Dissociation Limit without Breaking Spin Symmetry by a Random-Phase-Approximation Correlation Functional. Phys. Rev. Lett. 106 93001 (2011).

43. Scuseria, G. E., Henderson, T. M. & Bulik, I. W. Particle-particle and quasi-particle random phase approximations: Connections to coupled cluster theory. J. Chem. Phys. 139, 104113 (2013).

44. Aggelen, H. van, Yang, Y. & Yang, W. Exchange-correlation energy from pairing matrix fluctuation and the particle-particle random phase approximation. J. Chem. Phys. 140, 18A511 (2014).

45. Sharkas, K., Savin, A., Jensen, H. J. A. & Toulouse, J. A multiconfigurational hybrid density-functional theory. J. Chem. Phys. 137, 044104 (2012).

46. Aulbur, W. G., Jönsson, L. & Wilkins, J. W. in Solid State Physics (ed. SPAEPEN, H. E. and F.) 54, 1–218 (Academic Press, 1999).

47. Rinke, P., Qteish, A., Neugebauer, J., Freysoldt, C. & Scheffler, M. Combining GW calculations with exact-exchange density-functional theory: an analysis of valence-band

photoemission for compound semiconductors. New J. Phys. 7, 126 (2005).

48. Hüser, F., Olsen, T. & Thygesen, K. S. Quasiparticle GW calculations for solids, molecules, and two-dimensional materials. Phys. Rev. B 87, 235132 (2013).

49. Holm, B. & von Barth, U. Fully self-consistent GW self-energy of the electron gas. Phys. Rev. B 57, 2108–2117 (1998).

50. Stan, A., Dahlen, N. E. & Leeuwen, R. van. Time propagation of the Kadanoff–Baym equations for inhomogeneous systems. J. Chem. Phys. 130, 224101 (2009).

51. Caruso, F., Rinke, P., Ren, X., Rubio, A. & Scheffler, M. Self-consistent GW: All-electron implementation with localized basis functions. Phys. Rev. B 88, 75105 (2013).

52. Onida, G., Reining, L. & Rubio, A. Electronic excitations: density-functional versus many-body Green’s function approaches. Rev. Mod. Phys. 74, 601–659 (2002).

53. Car, R. & Parrinello, M. Unified Approach fo r Molecular Dynamics and Density-Functional Theory. Phys. Rev. Lett.55, 2471–2474 (1985).

54. Marx, D. & Parrin ello, M. Ab initio path integral molecular dynamics: Basic ideas. J. Chem. Phys. 104, 4077–4082 (1996).

55. Mar ques, M. A. L. & Gross, E. K. U. Time-Dependent Density Functional Theory. Annu. Rev. Phys. Chem. 55, 427–455 (2004).

56. Horsfiel d, A. P. et al. Correlated electron-ion dynamics in metallic systems. Comput. Mater. Sci. 44, 16–20 (2008).

57. Sanschez, J.M., Ducastelle F. & Gratias D. Generalized cluster description of multicomponent systems. Physica A 128, 334 (1984).

58. Nelson, L. J., et. al. Cluster expansion made easy with Bayesian compressive sensing. Phys. Rev. B 88, 155105 (2013).

59. Fichthorn, K. A. & Weinberg, W. H. Theoretical foundations of dynamic montecarlo simulations. J. Chem. Phys. 95, 1090–1096 (1991).

60. Reuter, K. & Scheffler, M. First-principles kinetic Monte Carlo simulations for heterogeneous catalysis: Application to the CO oxidation at RuO2 (110). Phys. Rev. B 73, 045433 (2006).

61. Bhattacharya, S., Levchenko, S. V., Ghiringhelli, L. M. & Scheffler, M. Efficient ab initio schemes for finding thermodynamically stable and metastable atomic structures: benchmark of cascade genetic algorithms. New J. Phys. 16, 123016 (2014).

62. Schubert, F. et al. Exploring the conformational preferences of 20 residue peptides in isolation: Ac-Ala19-Lys + H+vs. Ac-Lys-Ala19 + H+ and the current reach of DFT. Phys. Chem. Chem. Phys. 17, 7373–7385 (2015).

63. Supady, A., Blum, V. & Baldauf, C. First-principles molecular structure search with a genetic algorithm. J. Chem. Inf. Model. 55, 2338, 2015.

64. Ropo, M., Schneider, M., Baldauf, C. & Blum, V. First-principles data set of 45,892 isolated and cation-coordinated conformers of 20 proteinogenic amino acids. Sci. Data 3, 160009, 2016.

65. About the Materials Genome Initiative. The White House at https://www.whitehouse.gov /node/164866.

66. Reuter, K., Stampf, C. & Scheffler, M. in Handbook of Materials Modelling (ed. Yip, S.) 149–194 (Springer Netherlands, 2005). At http://link.springer.com/chapter/10.1007 /9781402032868_10.

67. Walle, A. van de & Ceder, G. Automating first-principles phase diagram calculation s. J. Phase Equilibria 23, 348–359 (2002).

68. Curtarolo, S., Morgan, D., Persson, K., Rodgers, J. & Ceder, G. Predicting Crystal Structures with Data Mining of Quantum Calculations. Phys. Rev. Lett. 91, 135503 (2003).

69. Curtarolo, S. et al. AFLOW: An automatic framework for high-throughput materials discovery. Comput. Mater. Sci. 58, 218–226 (2012).

70. Snyder, J. C., Rupp, M., Hansen, K., Müller, K.R. & Burke, K. Finding Density Functionals with Machine Learning. Phys. Rev. Lett. 108, 253002 (2012).

71. Montavon, G. et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 15, 095003 (2013).

72. Ghiringhelli, L. M., Vybiral, J., Levchenko, S. V., Draxl, C. & Scheffler, M. Big Data of Materials Science: Critical Role of the Descriptor. Phys. Rev. Lett. 114, 105503 (2015).