Pierre Baldi

QCD-jets at the LHC are described by simple physics principles. We show how super-resolution generative networks can learn the underlying structures and use them to improve the resolution of jet images. We test this approach on massless... more

QCD-jets at the LHC are described by simple physics principles. We show how super-resolution generative networks can learn the underlying structures and use them to improve the resolution of jet images. We test this approach on massless QCD-jets and on fat top-jets and find that the network reproduces their main features even without training on pure samples. In addition, we show how a slim network architecture can be constructed once we have control of the full network performance.

Publisher: Stichting SciPost

Publication Name: SciPost Physics

Research Interests:
Physics, Quantum Chromodynamics, and Large Hadron Collider

Download (.pdf)

Publisher: WORLD SCIENTIFIC

Publication Date: 2022

Publication Name: Artificial Intelligence for High Energy Physics

Research Interests:
Computer Science, Artificial Intelligence, Physics, Architecture, Deep Learning, and Parameterized Complexity

Download (.pdf)

A brief idea

Publication Date: 2016

Research Interests:
Engineering and Computer Science

Publisher: Elsevier BV

Publication Date: 2020

Publication Name: SoftwareX

Research Interests:
Mathematics, Computer Science, Artificial Intelligence, Machine Learning, Documentation, and Hyperparameter Optimization

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2018

Publication Name: Neural Networks

Research Interests:
Computer Science, Artificial Intelligence, Neural Networks, Medicine, Multidisciplinary, and 3 moreDeep Learning, Backpropagation, and Artificial Neural Network

Download (.pdf)

Publisher: American Physical Society (APS)

Publication Date: 2017

Publication Name: Physical Review D

Research Interests:
Mathematics, Computer Science, Physics, Physical, Parametric Statistics, and 2 moreArtificial Neural Network and substructure

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2017

Publication Name: Neural Networks

Research Interests:
Computer Science, Artificial Intelligence, Machine Learning, Neural Networks, Medicine, and 4 moreMultidisciplinary, Deep Learning, Backpropagation, and Computer Science Neural Networks

Download (.pdf)

Publisher: IOP Publishing

Publication Date: 2017

Publication Name: Journal of Physics Communications

Research Interests:
Computer Science, Physics, Nuclear Physics, Antihydrogen, Antimatter, and 3 moreLarge Hadron Collider, Annihilation, and arXiv

Download (.pdf)

Publisher: American Physical Society (APS)

Publication Date: 2016

Publication Name: Physical Review D

Research Interests:
Computer Science, Artificial Intelligence, Physics, Physical, Artificial Neural Network, and Curse of Dimensionality

Download (.pdf)

Publisher: IEEE

Publication Date: 2016

Publication Name: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Research Interests:
Mathematics, Computer Science, Artificial Intelligence, Physics, Machine Learning, and 10 moreHigh Energy Physics, Neutrino, Deep Learning, Unsupervised Learning, Cluster Analysis, Raw Data, Artificial Neural Network, Convolutional Neural Network, Autoencoders, and Representation Politics

Download (.pdf)

Sherpa is a free open-source hyperparameter optimization library for machine learning models. It is designed for problems with computationally expensive iterative function evaluations, such as the hyperparameter tuning of deep neural... more

Sherpa is a free open-source hyperparameter optimization library for machine learning models. It is designed for problems with computationally expensive iterative function evaluations, such as the hyperparameter tuning of deep neural networks. With Sherpa, scientists can quickly optimize hyperparameters using a variety of powerful and interchangeable algorithms. Additionally, the framework makes it easy to implement custom algorithms. Sherpa can be run on either a single machine or a cluster via a grid scheduler with minimal configuration. Finally, an interactive dashboard enables users to view the progress of models as they are trained, cancel trials, and explore which hyperparameter combinations are working best. Sherpa empowers machine learning researchers by automating the tedious aspects of model tuning and providing an extensible framework for developing automated hyperparameter-tuning strategies. Its source code and documentation are available at https://github.com/LarsHH/she...

Publication Date: 2018

Research Interests:
Computer Science, Artificial Intelligence, Machine Learning, and Hyperparameter Optimization

Download (.pdf)

Publisher: American Chemical Society

Publication Date: Sep 2, 2011

Publication Name: Journal of Chemical Information and Modeling

Research Interests:
Computer Science, Artificial Intelligence, Organic Chemistry, Machine Learning, Chemical, and 5 moreMedicine, Computer User Interface Design, Scalability, THEORETICAL AND COMPUTATIONAL CHEMISTRY, and Internet

Download (.pdf)

Publication Date: 2005

Publication Name: The European Symposium on Artificial Neural Networks

Research Interests:
Computer Science, Machine Learning, Numerical Simulation, Support vector machine, Game of Go, and 2 moreKernel Function and Similarity Geometry

Download (.pdf)

Publisher: Wiley

Publication Date: Aug 14, 2007

Publication Name: ChemInform

Research Interests:
Computer Science, Chemistry, Informatics, Chemical, Medicine, and 8 moreVirtual screening, Benzodiazepines, Solubility, Glass Transition Temperature, THEORETICAL AND COMPUTATIONAL CHEMISTRY, Alkanes, Transition Temperature, and Pharmaceutical preparations

Download (.pdf)

Publisher: Oxford University Press

Publication Date: Jun 28, 2007

Publication Name: Bioinformatics

Research Interests:
Bioinformatics, Computer Science, Information Retrieval, Artificial Intelligence, Natural Language Processing, and 15 moreMedicine, Database Management Systems, Biological Sciences, Mathematical Sciences, Search Engine, Computer User Interface Design, Information Storage and Retrieval, Chemical space, D structure, Organic Chemicals, Chemical Structure, Melting Temperature, Database Search, building block, and Matching statistics

Download (.pdf)

Publisher: Oxford University Press

Publication Date: Sep 20, 2005

Publication Name: Bioinformatics

Research Interests:
Bioinformatics, Computer Science, Chemistry, Cheminformatics, Genomics, and 15 moreComputational Biology, Access To Information, Medicine, Biological Sciences, Human Resources for Health, Mathematical Sciences, Computer User Interface Design, Information Storage and Retrieval, Database, Chemical Synthesis, Internet, Computational Method, Molecular Probe, Health Resources, and building block

Download (.pdf)

ABSTRACT Computer-based learning systems enable interactive learning opportunities that are not possible in a traditional teaching setting. We have previously developed Reaction Explorer an interactive tutorial system for organic... more

ABSTRACT Computer-based learning systems enable interactive learning opportunities that are not possible in a traditional teaching setting. We have previously developed Reaction Explorer an interactive tutorial system for organic chemistry, synthesis, and mechanisms at the college level. The tutorial is powered by an underlying organic chemistry expert system comprising over 1,500 reaction rules, allowing it to generate a virtually infinite collection of problems, and has been used by students at our University for the past three years. The work presented here seeks to develop novel intelligent modules to optimize and personalize student learning trajectories by monitoring each step in a student&#39;s progress and learning to propose optimal individualized problems that are at the boundary of a student&#39;s knowledge. Specifically, the system is being upgraded with modules for computer-based dynamic assessment and personalized instruction based on concepts from the theory of knowledge spaces.

Publisher: American Chemical Society

Publication Date: 2010

Publication Name: Acs Symposium Series

Research Interests:
Computer Science

Modern therapeutic research is a very time-consuming, complex and costly process which can considerably benefit from the use of statistical machine learning techniques. In particular, using predictive models to quantify the toxicity or... more

Modern therapeutic research is a very time-consuming, complex and costly process which can considerably benefit from the use of statistical machine learning techniques. In particular, using predictive models to quantify the toxicity or activity of a molecule allows to considerably alleviate the cost of the discovery and development of a new drug. We develop and study structure-based feature representations of small molecules and successfully leverage them to create predictors for several of their chemical, physical and biological properties. We address the prediction of biological activity more in depth by studying virtual high-throughput screening (vHTS), which aims at exploiting a first exploratory biological screen to learn how to rank untested compounds according to their activity against a particular target. More specifically, we present a new algorithm, the Influence Relevance Voter (IRV), particularly tailored to that problem, and show that it is preferable to state-of-the-art methods. One of the most desirable qualities of a vHTS algorithm is its ability to present the most active compounds in the very top ranked molecules. This capacity for what is called “early recognition” allows experimentalists to focus only on a small fraction of the compounds. To properly analyze and compare virtual high-throughput screening algorithms, we develop the concentrated receiving-operator characteristic (CROC) framework, an extension of the ROC framework for the quantitative evaluation, visualization, and optimization of early recognition. Finally we develop machine learning methods for the challenging problem of reaction prediction. Inspired by human chemists, we study elementary reaction steps; in this approach reaction prediction becomes a matter of learning to rank elementary mechanisms by favorability. We do not address this task directly, but rather undertake two necessary preliminary problems. We first develop a large database of elementary mechanisms, annotated with favorability information. We then propose a feature representation of the atoms of a molecule, which we leverage to predict whether or not they belong to a site of reactivity; eventually such a classifier can be used to filter out disfavored elementary reactions.

Publication Date: 2010

Research Interests:
Computer Science, Artificial Intelligence, Cheminformatics, and Machine Learning

Publisher: Oxford University Press

Publication Date: Jun 1, 2005

Publication Name: Bioinformatics

Research Interests:
Bioinformatics, Mathematics, Computer Science, Organic Chemistry, Computational Biology, and 15 moreMedicine, Biological Sciences, Computer Simulation, Mathematical Sciences, Mice, Female, Animals, Male, Cross Validation, Biological systems, Neoplasms, Mutagens, Antineoplastic Agents, Computational Method, and Depth First Search

Download (.pdf)

Publisher: Association for Research in Vision and Ophthalmology (ARVO)

Publication Name: Translational Vision Science & Technology

Research Interests:
Computer Science, Artificial Intelligence, Computer Vision, Laterality, and Surgical Instrument

Humans perceive light in the visible spectrum (400-700 nm). Some night vision systems use infrared light that is not perceptible to humans and the images rendered are transposed to a digital display presenting a monochromatic image in the... more

Humans perceive light in the visible spectrum (400-700 nm). Some night vision systems use infrared light that is not perceptible to humans and the images rendered are transposed to a digital display presenting a monochromatic image in the visible spectrum. We sought to develop an imaging algorithm powered by optimized deep learning architectures whereby infrared spectral illumination of a scene could be used to predict a visible spectrum rendering of the scene as if it were perceived by a human with visible spectrum light. This would make it possible to digitally render a visible spectrum scene to humans when they are otherwise in complete “darkness” and only illuminated with infrared light. To achieve this goal, we used a monochromatic camera sensitive to visible and near infrared light to acquire an image dataset of printed images of faces under multispectral illumination spanning standard visible red (604 nm), green (529 nm) and blue (447 nm) as well as infrared wavelengths (718,...

Publisher: Public Library of Science (PLoS)

Publication Name: PLOS ONE

Research Interests:
Artificial Intelligence, Optics, Computer Vision, Rendering (Computer Graphics), Medicine, and 7 moreMultidisciplinary, PLoS one, Infrared, Night Vision, Monochromatic Color, Multispectral image, and Visible spectrum

Download (.pdf)

Motivation Accurately predicting protein secondary structure and relative solvent accessibility is important for the study of protein evolution, structure and an early-stage component of typical protein 3D structure prediction pipelines.... more

Motivation Accurately predicting protein secondary structure and relative solvent accessibility is important for the study of protein evolution, structure and an early-stage component of typical protein 3D structure prediction pipelines. Results We present a new improved version of the SSpro/ACCpro suite of predictors for the prediction of protein secondary structure (in three and eight classes) and relative solvent accessibility. The changes include improved, TensorFlow-trained, deep learning predictors, a richer set of profile features (232 features per residue position) and sequence-only features (71 features per position), a more recent Protein Data Bank (PDB) snapshot for training, better hyperparameter tuning and improvements made to the HOMOLpro module, which leverages structural information from protein segment homologs in the PDB. The new SSpro 6 outperforms the previous version (SSpro 5) by 3–4% in Q3 accuracy and, when used with HOMOLPRO, reaches accuracy in the 95–100% r...

Publisher: Oxford University Press (OUP)

Publication Date: 2022

Publication Name: Bioinformatics

Research Interests:
Bioinformatics, Computer Science, Artificial Intelligence, Protein Structure Prediction, Biological Sciences, and 5 moreSoftware, Mathematical Sciences, Deep Learning, Protein Data Bank, and python programming language

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2007

Publication Name: Statistics & Probability Letters

Research Interests:
Mathematics, Applied Mathematics, Probability Theory, Econometrics, Statistics, and 8 morePrimary, Mathematical Analysis, Gaussian, Mathematics and Statistics, Harmonics, Spherical Harmonics, Random Field, and Isotropy

Download (.pdf)

Publisher: American Physical Society (APS)

Publication Name: Physical Review D

Research Interests:
Physics, Astrophysics, Fermi Gamma-ray Space Telescope, and Physical

Download (.pdf)

The high overlapping nature of various features across multiple mental health disorders suggests the existence of common psychopathology factor(s) (p-factors) that mediate similar phenotypic presentations across distinct but relatable... more

The high overlapping nature of various features across multiple mental health disorders suggests the existence of common psychopathology factor(s) (p-factors) that mediate similar phenotypic presentations across distinct but relatable disorders. In this perspective, we argue that circadian rhythm disruption (CRD) is a common underlying p-factor that bridges across mental health disorders within their age and sex contexts. We present and analyze evidence from the literature for the critical roles circadian rhythmicity plays in regulating mental, emotional, and behavioral functions throughout the lifespan. A review of the literature shows that coarse CRD, such as sleep disruption, is prevalent in all mental health disorders at the level of etiological and pathophysiological mechanisms and clinical phenotypical manifestations. Finally, we discuss the subtle interplay of CRD with sex in relation to these disorders across different stages of life. Our perspective highlights the need to s...

Publisher: Springer Science and Business Media LLC

Publication Name: Translational Psychiatry

Research Interests:
Psychology, Mental Health, Psychopathology, Circadian Rhythm, and translational psychiatry

Download (.pdf)

We study the effectiveness of theoretically-motivated high-level jet observables in the extreme context of jets with a large number of hard sub-jets (up to N = 8). Previous studies indicate that high-level observables are powerful,... more

We study the effectiveness of theoretically-motivated high-level jet observables in the extreme context of jets with a large number of hard sub-jets (up to N = 8). Previous studies indicate that high-level observables are powerful, interpretable tools to probe jet substructure for N ≤ 3 hard sub-jets, but that deep neural networks trained on low-level jet constituents match or slightly exceed their performance. We extend this work for up to N = 8 hard sub-jets, using deep particle-flow networks (PFNs) and Transformer based networks to estimate a loose upper bound on the classification performance. A fully-connected neural network operating on a standard set of high-level jet observables, 135 N-subjetiness observables and jet mass, reach classification accuracy of 86.90%, but fall short of the PFN and Transformer models, which reach classification accuracies of 89.19% and 91.27% respectively, suggesting that the constituent networks utilize information not captured by the set of high...

Publisher: Springer Science and Business Media LLC

Publication Name: Journal of High Energy Physics

Research Interests:
Physics, High Energy Physics, Mathematical Sciences, Physical sciences, and substructure

Download (.pdf)

Publication Date: 2012

Research Interests:
Animal Behavior, Cognition, Epilepsy, Biology, Medicine, and 15 moreBrain, Humans, Mice, Entrainment, Animals, Phosphorylation, New Treatment for Schizophrenia, Grooming, Behavioral Flexibility, Exploratory Behavior, mTOR signaling, Circadian Clock, Neurological Disease, Benzamides, and Motor activity

Download (.pdf)

Publisher: Wiley-Blackwell

Publication Date: 2007

Publication Name: Proteins: Structure, Function, and Bioinformatics

Research Interests:
Bioinformatics, Computer Science, Algorithms, Molecular Dynamics Simulation, Genomics, and 15 moreProtein Folding, Computational Biology, Evaluation, Protein Science, Cancer Biology, Medicine, Next generation sequencing, Biological Sciences, Mathematical Sciences, Proteins, Biomolecular Modeling, Protein Function Prediction, Protein Conformation, Gene expression and regulation, and binding sites

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2016

Publication Name: Neural Networks

Research Interests:
Mathematics, Computer Science, Algorithms, Machine Learning, Neural Networks, and 4 moreMedicine, Multidisciplinary, Deep Learning, and Feedback

Download (.pdf)

Colorectal cancer (CRC) is a leading cause of mortality worldwide, and preventive screening modalities such as colonoscopy have been shown to noticeably decrease CRC incidence and mortality. Improving colonoscopy quality remains a... more

Colorectal cancer (CRC) is a leading cause of mortality worldwide, and preventive screening modalities such as colonoscopy have been shown to noticeably decrease CRC incidence and mortality. Improving colonoscopy quality remains a challenging task due to limiting factors including the training levels of colonoscopists and the variability in polyp sizes, morphologies, and locations. Deep learning methods have led to state-of-the-art systems for the identification of polyps in colonoscopy videos. In this study, we show that deep learning can also be applied to the segmentation of polyps in real time, and the underlying models can be trained using mostly weakly labeled data, in the form of bounding box annotations that do not contain precise contour information. A novel dataset, Polyp-Box-Seg of 4070 colonoscopy images with polyps from over 2000 patients, is collected, and a subset of 1300 images is manually annotated with segmentation masks. A series of models is trained to evaluate v...

Publisher: MDPI AG

Publication Name: Journal of Imaging

Research Interests:
Imaging

Download (.pdf)

We report a method for the phase reconstruction of an ultrashort laser pulse based on the deep learning of the nonlinear spectral changes induce by self-phase modulation. The neural networks were trained on simulated pulses with random... more

We report a method for the phase reconstruction of an ultrashort laser pulse based on the deep learning of the nonlinear spectral changes induce by self-phase modulation. The neural networks were trained on simulated pulses with random initial phases and spectra, with pulse durations between 8.5 and 65 fs. The reconstruction is valid with moderate spectral resolution, and is robust to noise. The method was validated on experimental data produced from an ultrafast laser system, where near real-time phase reconstructions were performed. This method can be used in systems with known linear and nonlinear responses, even when the fluence is not known, making this method ideal for difficult to measure beams such as the high energy, large aperture beams produced in petawatt systems.

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

Publication Name: Scientific Reports

Download (.pdf)

Attention plays a fundamental role in both natural and artificial intelligence systems. In deep learning, attention-based neural architectures, such as transformer architectures, are widely used to tackle problems in natural language... more

Attention plays a fundamental role in both natural and artificial intelligence systems. In deep learning, attention-based neural architectures, such as transformer architectures, are widely used to tackle problems in natural language processing and beyond. Here we investigate the fundamental building blocks of attention and their computational properties. Within the standard model of deep learning, we classify all possible fundamental building blocks of attention in terms of their source, target, and computational mechanism. We identify and study three most important mechanisms: additive activation attention, multiplicative output attention (output gating), and multiplicative synaptic attention (synaptic gating). The gating mechanisms correspond to multiplicative extensions of the standard model and are used across all current attention-based deep learning architectures. We study their functional properties and estimate the capacity of several attentional building blocks in the case...

Publication Date: 2022

Research Interests:
Mathematics and Computer Science

Download (.pdf)

A simple way to generate a Boolean function is to take the sign of a real polynomial in n variables. Such Boolean functions are called polynomial threshold functions. How many low-degree polynomial threshold functions are there? The... more

A simple way to generate a Boolean function is to take the sign of a real polynomial in n variables. Such Boolean functions are called polynomial threshold functions. How many low-degree polynomial threshold functions are there? The partial case of this problem for degree d=1 was solved by Zuev in 1989, who showed that the number T(n,1) of linear threshold functions satisfies _2 T(n,1) ≈ n^2, up to smaller order terms. However the number of polynomial threshold functions for any higher degrees, including d=2, has remained open. We settle this problem for all fixed degrees d >1, showing that _2 T(n,d) ≈ n n< d. The solution relies on connections between the theory of Boolean threshold functions, hyperplane arrangements, and random tensors. Perhaps surprisingly, it uses also a recent result of E.Abbe, A.Shpilka, and A.Wigderson on Reed-Muller codes.

Publication Date: Mar 28, 2018

Research Interests:
Mathematics, Computer Science, and Hyperplane

Download (.pdf)

In a physical neural system, backpropagation is faced with a number of obstacles including: the need for labeled data, the violation of the locality learning principle, the need for symmetric connections, and the lack of modularity.... more

In a physical neural system, backpropagation is faced with a number of obstacles including: the need for labeled data, the violation of the locality learning principle, the need for symmetric connections, and the lack of modularity. Tourbillon is a new architecture that addresses all these limitations. At its core, it consists of a stack of circular autoencoders followed by an output layer. The circular autoencoders are trained in self-supervised mode by recirculation algorithms and the top layer in supervised mode by stochastic gradient descent, with the option of propagating error information through the entire stack using non-symmetric connections. While the Tourbillon architecture is meant primarily to address physical constraints, and not to improve current engineering applications of deep learning, we demonstrate its viability on standard benchmark datasets including MNIST, Fashion MNIST, and CIFAR10. We show that Tourbillon can achieve comparable performance to models trained...

Publication Date: Jul 22, 2021

Research Interests:
Computer Science and arXiv

Download (.pdf)

Reinforcement learning algorithms can show strong variation in performance between training runs with different random seeds. In this paper we explore how this affects hyperparameter optimization when the goal is to find hyperparameter... more

Reinforcement learning algorithms can show strong variation in performance between training runs with different random seeds. In this paper we explore how this affects hyperparameter optimization when the goal is to find hyperparameter settings that perform well across random seeds. In particular, we benchmark whether it is better to explore a large quantity of hyperparameter settings via pruning of bad performers, or if it is better to aim for quality of collected results by using repetitions. For this we consider the Successive Halving, Random Search, and Bayesian Optimization algorithms, the latter two with and without repetitions. We apply these to tuning the PPO2 algorithm on the Cartpole balancing task and the Inverted Pendulum Swing-up task. We demonstrate that pruning may negatively affect the optimization and that repeated sampling does not help in finding hyperparameter settings that perform better across random seeds. From our experiments we conclude that Bayesian optimiz...

Publication Date: Jul 30, 2020

Research Interests:
Mathematics, Computer Science, Artificial Intelligence, Reinforcement Learning, Machine Learning, and 2 moreHyperparameter Optimization and arXiv

Download (.pdf)

Machine learning algorithms often make decisions on behalf of agents with varied and sometimes conflicting interests. In domains where agents can choose to take their own action or delegate their action to a central mediator, an open... more

Machine learning algorithms often make decisions on behalf of agents with varied and sometimes conflicting interests. In domains where agents can choose to take their own action or delegate their action to a central mediator, an open question is how mediators should take actions on behalf of delegating agents. The main existing approach uses delegating agents to punish non-delegating agents in an attempt to get all agents to delegate, which tends to be costly for all. We introduce a Pareto Mediator which aims to improve outcomes for delegating agents without making any of them worse off. Our experiments in random normal form games, a restaurant recommendation game, and a reinforcement learning sequential social dilemma show that the Pareto Mediator greatly increases social welfare. Also, even when the Pareto Mediator is based on an incorrect model of agent utility, performance gracefully degrades to the pre-intervention level, due to the individual autonomy preserved by the voluntar...

Publisher: ArXiv

Publication Date: 2021

Publication Name: ArXiv

Research Interests:
Computer Science, Welfare, and arXiv

Download (.pdf)

Reinforcement learning algorithms can show strong variation in performance between training runs with different random seeds. In this paper we explore how this affects hyperparameter optimization when the goal is to find hyperparameter... more

Reinforcement learning algorithms can show strong variation in performance between training runs with different random seeds. In this paper we explore how this affects hyperparameter optimization when the goal is to find hyperparameter settings that perform well across random seeds. In particular, we benchmark whether it is better to explore a large quantity of hyperparameter settings via pruning of bad performers, or if it is better to aim for quality of collected results by using repetitions. For this we consider the Successive Halving, Random Search, and Bayesian Optimization algorithms, the latter two with and without repetitions. We apply these to tuning the PPO2 algorithm on the Cartpole balancing task and the Inverted Pendulum Swing-up task. We demonstrate that pruning may negatively affect the optimization and that repeated sampling does not help in finding hyperparameter settings that perform better across random seeds. From our experiments we conclude that Bayesian optimiz...

Publisher: ArXiv

Publication Date: 2020

Publication Name: ArXiv

Research Interests:
Mathematics, Computer Science, Artificial Intelligence, Reinforcement Learning, Machine Learning, and 2 moreHyperparameter Optimization and arXiv

Download (.pdf)

Particle colliders are the primary experimental instruments of high-energy physics. By creating conditions that have not occurred naturally since the Big Bang, collider experiments aim to probe the most fundamental properties of matter... more

Particle colliders are the primary experimental instruments of high-energy physics. By creating conditions that have not occurred naturally since the Big Bang, collider experiments aim to probe the most fundamental properties of matter and the universe. These costly experiments generate very large amounts of noisy data, creating important challenges and opportunities for machine learning. In this work we use deep learning to greatly improve the statistical power on three benchmark problems involving: (1) Higgs bosons; (2) supersymmetric particles; and (3) Higgs boson decay modes. This approach increases the expected discovery significance over traditional shallow methods, by 50%, 2%, and 11% respectively. In addition, we explore the use of model compression to transfer information (dark knowledge) from deep networks to shallow networks.

Publisher: HEPML@NIPS

Publication Date: 2014

Research Interests:
Machine Learning, Dark Matter, High Energy Physics, and Deep Learning

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2020

Publication Name: Computational and Structural Biotechnology Journal

Research Interests:
Computer Science, Biology, and Elsevier

Download (.pdf)

Publisher: Springer Science+Business Media

Publication Date: May 11, 2006

Publication Name: Data Mining and Knowledge Discovery

Publisher: Wiley

Publication Date: Mar 26, 2002

Publication Name: Proteins

Publisher: Wiley

Publication Date: Dec 21, 2005

Publication Name: Proteins

Publication Date: 2002

Publication Name: Neural Information Processing Systems

Publisher: Institute of Electrical and Electronics Engineers

Publication Date: 2008

Publication Name: IEEE Reviews in Biomedical Engineering

Publication Date: Dec 1, 2004

Publication Name: Neural Information Processing Systems

Publisher: Oxford University Press

Publication Date: Jun 1, 2005

Publication Name: Bioinformatics

Publisher: Springer Science+Business Media

Publication Date: Jul 14, 2005

Publication Name: Data Mining and Knowledge Discovery

Publisher: Springer Science+Business Media

Publication Date: Apr 2, 2007

Publication Name: BMC Bioinformatics

Publisher: Mary Ann Liebert, Inc.

Publication Date: Apr 1, 2006

Publication Name: Journal of Computational Biology

Publisher: Cornell University

Publication Date: May 8, 2020

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Artificial Intelligence, Machine Learning, Documentation, and Hyperparameter Optimization <div> () </div>

Publisher: Stichting SciPost

Publication Name: SciPost Physics

Research Interests: Physics, Quantum Chromodynamics, and Large Hadron Collider <div> () </div>

Publisher: WORLD SCIENTIFIC

Publication Date: 2022

Publication Name: Artificial Intelligence for High Energy Physics

Research Interests: Computer Science, Artificial Intelligence, Physics, Architecture, Deep Learning, and Parameterized Complexity <div> () </div>

Publication Date: 2016

Research Interests: Engineering and Computer Science <div> () </div>

Publisher: Elsevier BV

Publication Date: 2020

Publication Name: SoftwareX

Research Interests: Mathematics, Computer Science, Artificial Intelligence, Machine Learning, Documentation, and Hyperparameter Optimization <div> () </div>

Publisher: Elsevier BV

Publication Date: 2018

Publication Name: Neural Networks

Publisher: American Physical Society (APS)

Publication Date: 2017

Publication Name: Physical Review D

Publisher: Elsevier BV

Publication Date: 2017

Publication Name: Neural Networks

Publisher: IOP Publishing

Publication Date: 2017

Publication Name: Journal of Physics Communications

Publisher: American Physical Society (APS)

Publication Date: 2016

Publication Name: Physical Review D

Research Interests: Computer Science, Artificial Intelligence, Physics, Physical, Artificial Neural Network, and Curse of Dimensionality <div> () </div>

Publisher: IEEE

Publication Date: 2016

Publication Name: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Publication Date: 2018

Research Interests: Computer Science, Artificial Intelligence, Machine Learning, and Hyperparameter Optimization <div> () </div>

Publisher: American Chemical Society

Publication Date: Sep 2, 2011

Publication Name: Journal of Chemical Information and Modeling

Publication Date: 2005

Publication Name: The European Symposium on Artificial Neural Networks

Publisher: Wiley

Publication Date: Aug 14, 2007

Publication Name: ChemInform

Publisher: Oxford University Press

Publication Date: Jun 28, 2007

Publication Name: Bioinformatics

Publisher: Oxford University Press

Publication Date: Sep 20, 2005

Research Interests:
Computer Science, Artificial Intelligence, Machine Learning, Documentation, and Hyperparameter Optimization

Research Interests:
Physics, Quantum Chromodynamics, and Large Hadron Collider

Research Interests:
Computer Science, Artificial Intelligence, Physics, Architecture, Deep Learning, and Parameterized Complexity

Research Interests:
Engineering and Computer Science

Research Interests:
Mathematics, Computer Science, Artificial Intelligence, Machine Learning, Documentation, and Hyperparameter Optimization

Research Interests:
Computer Science, Artificial Intelligence, Physics, Physical, Artificial Neural Network, and Curse of Dimensionality

Research Interests:
Computer Science, Artificial Intelligence, Machine Learning, and Hyperparameter Optimization

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, Cheminformatics, and Machine Learning

Research Interests:
Computer Science, Artificial Intelligence, Computer Vision, Laterality, and Surgical Instrument

Research Interests:
Physics, Astrophysics, Fermi Gamma-ray Space Telescope, and Physical

Research Interests:
Psychology, Mental Health, Psychopathology, Circadian Rhythm, and translational psychiatry

Research Interests:
Physics, High Energy Physics, Mathematical Sciences, Physical sciences, and substructure

Research Interests:
Imaging