Hybrid

Every molecule, everywhere, all at once

Tue Apr 22, 2025 4:00 p.m.—5:00 p.m.
Sterling Chemistry Laboratory
225 Prospect Street New Haven, CT 06511
  • Faculty
  • Staff
  • Graduate & Professional
  • Students
  • Undergraduate

Please join Yale Chemistry for a Silliman Seminar in Theory Chemistry with Ramon Miranda Quintana, Assistant Professor, from University of Florida.

Summary: Quantifying similarity is a central notion in science and data analysis, pervading everything from phylogenetic trees to the foundation of clustering. Unfortunately, despite being examined and applied for decades, traditional similarity and distance metrics have fundamental drawbacks. The key problem is that all of them are only defined over pairs of objects, so they scale quadratically when one tries to compare N objects. The present explosion in the amount of data available to us requires new ways to process information, and while some current algorithms can handle millions of points, we need alternatives applicable to billions. This is what motivated us to develop a new framework that can compare any number of objects at the same time. With this, we achieve an unprecedented linear scaling when comparing multiple objects. Here we will discuss the main properties of this formalism, along with its applications in drug design and to the analysis of Molecular Dynamics (MD) simulations. Our indices have proven to be incredibly versatile when applied to chemical space exploration and visualization, allowing us to rigorously quantify the chemical diversity of very large molecular libraries. This has led to the creation of several algorithms to sample important regions in chemical space, including a more efficient way of identifying the prevalence of activity cliffs. Additionally, our indices provide a convenient route to sample complex MD trajectories, allowing to identify representative structures very efficiently. Moreover, we can also cluster biological ensembles in a more robust way than with standard algorithms, which has led to our group’s work on MDANCE, a very flexible and efficient open-source clustering module.

For more information on research in the Miranda Quintana group: https://quintana.chem.ufl.edu/research/.

Faculty Host: Professors Tianyu Zhu and Victor Batista.

This seminar is generously sponsored by the Mrs. Hepsa Ely Silliman Memorial Fund.