Skip to main content

All Publications

Language-Driven Representation Learning for Robotics
Robotics | February 24, 2023

Recent work in visual representation learning for robotics demonstrates the viability of learning from large video datasets of humans performing everyday tasks. Leveraging methods such as masked autoencoding and contrastive learning, these representations exhibit strong transfer to policy learning for visuomotor control. But, robot learning encompasses a diverse set of problems beyond control including grasp affordance prediction, language-conditioned imitation learning, and intent scoring for human-robot collaboration, amongst others. First, we demonstrate that existing representations yield inconsistent results across these tasks: masked autoencoding approaches pick up on low-level spatial features at the cost of high-level semantics, while contrastive learning approaches capture the opposite. We then introduce Voltron, a framework for language-driven representation learning from human videos and associated captions. Voltron trades off language-conditioned visual reconstruction to learn low-level visual patterns, and visually-grounded language generation to encode high-level semantics. We also construct a new evaluation suite spanning five distinct robot learning problems – a unified platform for holistically evaluating visual representations for robotics. Through comprehensive, controlled experiments across all five problems, we find that Voltron’s language-driven representations outperform the prior state-of-the-art, especially on targeted problems requiring higher-level features. READ MORE

Image
Voltran framework image
Bridging scales with Machine Learning: From first principles statistical mechanics to continuum phase field computations to study order disorder transitions in LixCoO2 (Preprint)
Energy & Materials | February 17, 2023

LixTMO2 (TM=Ni, Co, Mn) forms an important family of cathode materials for Li-ion batteries, whose performance is strongly governed by Li composition-dependent crystal structure and phase stability. Here, we use LixCoO2 (LCO) as a model system to benchmark a machine learning-enabled framework for bridging scales in materials physics. We focus on two scales: (a) assemblies of thousands of atoms described by density functional theory-informed statistical mechanics, and (b) continuum phase field models to study the dynamics of order-disorder transitions in LCO. Central to the scale bridging is the rigorous, quantitatively accurate, representation of the free energy density and chemical potentials of this material system by coarsegraining formation energies for specific atomic configurations. We develop active learning workflows to train recently developed integrable deep neural networks for such high-dimensional free energy density and chemical potential functions. The resulting, first principles-informed, machine learning-enabled, phase-field computations allow us to study LCO cathodes' phase evolution in terms of temperature, morphology, charge cycling and particle size. READ MORE

Image
Flowchart outlining the data, computational methods and machine learning-enabled linkages that bridge from the atomic up to the continuum scale.
Informatics-Driven Selection of Polymers for Fuel-Cell Applications
Energy & Materials | January 4, 2023

Modern fuel cell technologies use Nafion as the material of choice for the proton exchange membrane (PEM) and as the binding material (ionomer) used to assemble the catalyst layers of the anode and cathode. These applications demand high proton conductivity as well as other requirements. For example, PEM is expected to block electrons, oxygen, and hydrogen from penetrating and diffusing while the anode/cathode ionomer should allow hydrogen/oxygen to move easily, so that they can reach the catalyst nanoparticles. Given some of the well-known limits of Nafion, such as low glass-transition temperature, the community is in the midst of an active search for Nafion replacements. In this work, we present an informatics-based scheme to search large polymer chemical spaces, which includes establishing a list of properties needed for the targeted applications, developing predictive machine-learning models for these properties, defining a search space, and using the developed models to screen the search space. Using the scheme, we have identified 60 new polymer candidates for PEM, anode ionomer, and cathode ionomer that we hope will be advanced to the next step, i.e., validating the designs through synthesis and testing. The proposed informatics scheme is generic, and it can be used to select polymers for multiple applications in the future. READ MORE

Image
infinite polymer space image
Look Both Ways: Self-supervising Driver Gaze Estimation and Road Scene Saliency
Human Interactive Driving | November 3, 2022

We present a new on-road driving dataset, called “Look Both Ways”, which contains synchronized video of both driver faces and the forward road scene, along with ground truth gaze data registered from eye tracking glasses worn by the drivers. Our dataset supports the study of methods for non-intrusively estimating a driver’s focus of attention while driving - an important application area in road safety. A key challenge is that this task requires accurate gaze estimation, but supervised appearance-based gaze estimation methods often do not transfer well to real driving datasets, and in-domain ground truth to supervise them is difficult to gather. We therefore propose a method for self-supervision of driver gaze, by taking advantage of the geometric consistency between the driver’s gaze direction and the saliency of the scene as observed by the driver. We formulate a 3D geometric learning framework to enforce this consistency, allowing the gaze model to supervise the scene saliency model, and vice versa. We implement a prototype of our method and test it with our dataset, to show that compared to a supervised approach it can yield better gaze estimation and scene saliency estimation with no additional labels. READ MORE

Image
look both ways dataset
Principles of the Battery Data Genome
Energy & Materials | October 19, 2022

Batteries are central to modern society. They are no longer just a convenience but a critical enabler of the transition to a resilient, low-carbon economy. Battery development capabilities are provided by communities spanning materials discovery, battery chemistry and electrochemistry, cell and pack design, scale-up, manufacturing, and deployments. Despite their relative maturity, data-science practices among these diverse groups are far behind the state of the art in other fields, which have demonstrated an ability to significantly improve innovation and economic impact. The negative consequences of the present paradigm include incremental improvements but few breakthroughs, significant manufacturing uncertainties, and cascading investment risks that collectively slow deployments. The primary roadblock to a battery-data-science renaissance is the requirement for large amounts of high-quality data, which are not available in the current fragmented ecosystem. Here, we identify gaps and propose principles that enable the solution by building a robust community of data hubs with standardized practices and flexible sharing options that will seed advanced tools spanning innovation to deployment. Precedents are offered that demonstrate that both public good and immense economic gains will arise from sharing valuable battery data. The proposed Battery Data Genome looks to broadly transform innovations and revolutionize their translation from research to societal impact. READ MORE

Image
image from battery data genome article
Cloth Funnels: Canonicalized-Alignment for Multi-Purpose Garment Manipulation
Robotics | October 17, 2022

Automating garment manipulation is challenging due to extremely high variability in object configurations. To reduce this intrinsic variation, we introduce the task of “canonicalized-alignment” that simplifies downstream applications by reducing the possible garment configurations. This task can be considered as “cloth state funnel” that manipulates arbitrarily configured clothing items into a predefined deformable configuration (i.e. canonicalization) at an appropriate rigid pose (i.e. alignment). In the end, the cloth items will result in a compact set of structured and highly visible configurations – which are desirable for downstream manipulation skills. To enable this task, we propose a novel canonicalized-alignment objective that effectively guides learning to avoid adverse local minima during learning. Using this objective, we learn a multi-arm, multi-primitive policy that strategically chooses between dynamic flings and quasi-static pick and place actions to achieve efficient canonicalized alignment. We evaluate this approach on a real-world ironing and folding system that relies on this learned policy as the common first step. Empirically, we demonstrate that our task-agnostic canonicalized-alignment can enable even simple manually-designed policies to work well where they were previously inadequate, thus bridging the gap between automated non-deformable manufacturing and deformable manipulation. READ MORE

Image
cloth funnels article image
Convex synthesis and verification of control-Lyapunov and barrier functions with input constraints
Human Interactive Driving | October 2, 2022

Control Lyapunov functions (CLFs) and control barrier functions (CBFs) are widely used tools for synthesizing controllers subject to stability and safety constraints. Paired with online optimization, they provide stabilizing control actions that satisfy input constraints and avoid unsafe regions of state-space. Designing CLFs and CBFs with rigorous performance guarantees is computationally challenging. To certify existence of control actions, current techniques not only design a CLF/CBF, but also a nominal controller. This can make the synthesis task more expensive, and performance estimation more conservative. In this work, we characterize polynomial CLFs/CBFs using sum-of-squares conditions, which can be directly certified using convex optimization. This yields a CLF and CBF synthesis technique that does not rely on a nominal controller. We then present algorithms for iteratively enlarging estimates of the stabilizable and safe regions. We demonstrate our algorithms on a 2D toy system, a pendulum and a quadrotor. READ MORE

Image
Control Lyapunov Functions
Strategies for Modulating the Catalytic Activity and Selectivity of Manganese Antimonates for the Oxygen Reduction Reaction
Energy & Materials | August 19, 2022

Strategies for improving the performance of nonprecious metal catalysts for the oxygen reduction reaction (ORR) can facilitate the cost-effective deployment of fuel cell devices. Electrocatalyst performance is typically improved via two approaches: increasing the number of active sites and increasing the intrinsic activity of the active site. Herein, we utilize these two methods of improving performance for MnSb2O6, which we have recently shown to be a promising ORR catalyst due to improvements in per-metal-site activity in the antimonate framework. First, electrode engineering is used to investigate the role of mass and conductive support loading in the observed ORR performance and selectivity. The apparent 2-electron selectivity is found to decrease with increases in mass and/or conductive support loading, indicating that rotating ring disk electrode studies do not necessarily measure the intrinsic selectivity of the catalyst. Second, theoretical calculations are used to identify Cr, Fe, and Ni as promising first-row transition metals for improving the intrinsic activity of MnSb2O6. Experimentally, the addition of Cr results in 3-fold and 2-fold increases in the mass and specific activities at 0.7 V vs the reversible hydrogen electrode, respectively. This enhancement is attributed to the modulation of the active site structure and Mn oxidation state with the addition of Cr. Through these studies, we gain insight into the intrinsic and extrinsic factors that govern ORR performance. READ MORE

Image
catalytic activity article image
Solving inorganic crystal structures from X-ray powder diffraction using a generative first-principles framework
Energy & Materials | August 10, 2022

X-ray powder diffraction (XRD) is a powerful structure characterization technique, but solving unknown inorganic crystal structures from powder diffraction patterns can often be labor-intensive or speculative. We introduce an Automated XRD to Structure (AXS) solution method based on the crystal symmetry obtained from XRD patterns, an efficient search of candidate structures spanning the available degrees of freedom, and density functional theory (DFT). This methodology is completely agnostic to structural prototypes and robust in solving inorganic structures of various chemistries, crystal systems, and unit cell sizes; 92% of all crystal structures were accurately determined from the simulated XRD patterns in our benchmark set. In addition, we demonstrate the efficacy of this methodology on experimental XRD patterns by solving the crystal structures of Li8Hf6O8, Li3CrO4, and LiFeO2. READ MORE

Image
Structure Candidates image
Reflections on one million compounds in the open quantum materials database (OQMD)
Energy & Materials | July 13, 2022

Density functional theory (DFT) has been widely applied in modern materials discovery and many materials databases, including the open quantum materials database (OQMD), contain large collections of calculated DFT properties of experimentally known crystal structures and hypothetical predicted compounds. Since the beginning of the OQMD in late 2010, over one million compounds have now been calculated and stored in the database, which is constantly used by worldwide researchers in advancing materials studies. The growth of the OQMD depends on project-based high-throughput DFT calculations, including structure-based projects, property-based projects, and most recently, machine-learning-based projects. Another major goal of the OQMD is to ensure the openness of its materials data to the public and the OQMD developers are constantly working with other materials databases to reach a universal querying protocol in support of the FAIR data principles. READ MORE

Image
open quantum materials database image