BEGIN:VCALENDAR
VERSION:2.0
PRODID:-// - ECPv5.3.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://aarms.math.ca
X-WR-CALDESC:Events for 
BEGIN:VTIMEZONE
TZID:UTC
BEGIN:STANDARD
TZOFFSETFROM:+0000
TZOFFSETTO:+0000
TZNAME:UTC
DTSTART:20220101T000000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=UTC:20220412T110000
DTEND;TZID=UTC:20220412T120000
DTSTAMP:20260501T025601
CREATED:20220411T164445Z
LAST-MODIFIED:20220411T164445Z
UID:6644-1649761200-1649764800@aarms.math.ca
SUMMARY:AARMS Scientific Machine Learning Seminar: Michael W. Dunham (Department of Earth Sciences\, Memorial University)
DESCRIPTION:Semisupervised machine learning algorithms and their application to geoscience classification problems\n\nIn recent years\, many disciplines have been challenged with trying to efficiently extract meaning\, or value\, out of large datasets. Technological advances have improved data storage capabilities as well as how data can be obtained (e.g.\, real-time data). Manually interpreting data that are exponentially growing in volume has obvious management and analysis challenges. Machine learning is a solution to these challenges. Machine learning algorithms teach computers to recognize patterns in data and assign repetitive patterns to similar categories. This process automates pattern recognition of data and allows meaningful information to be extracted in an efficient manner.\n\n\nFor many machine learning problems\, there are sufficient data to train a wide range of algorithms. Some applications\, such as image classification and speech recognition\, have large training datasets readily available. However\, in several geoscience-related problems\, labeled data are generally obtained by sampling the earth in some manner (e.g.\, drilling wells\, field sampling\, etc.)\, which is not trivial due to cost and logistical factors. As such\, many earth science-related machine learning problems have limited training data. Supervised machine learning algorithms are prone to overfitting in scarce training data situations\, but semisupervised approaches are designed for these problems because the unlabelled data are also used to inform the learning process.\n\nThree geoscience applications inherently challenged with limited training data are well log classification\, seismic classification\, and bedrock lithology mapping. I apply various semisupervised algorithms to these three geoscience problems and determine if semisupervised algorithms can perform better than supervised methods and under what conditions\, if applicable. The semisupervised methods I consider are self-training\, label propagation\, and semisupervised Gaussian mixture models. I consider several supervised methods in my work\, but the most prevalent are gradient boosting decision tree methods (e.g.\, XGBoost\, LightGBM). The results show that semisupervised methods can outperform their supervised counterparts for each of the geoscience applications\, but there are situations where this is not always the case. Nonetheless\, semisupervised methods are rarely considered for many geoscience disciplines\, which is supported by the lack of published examples in the literature. The outcomes of this work help fill this gap\, but they also help raise the awareness of semisupervised methods.\n\n\nWebex link:\n\nhttps://mun.webex.com/mun/j.php?MTID=mf0e24b554219c531763a22ffce2e82c9
URL:https://aarms.math.ca/event/aarms-scientific-machine-learning-seminar-michael-w-dunham-department-of-earth-sciences-memorial-university/
LOCATION:WebEx seminar
CATEGORIES:AARMS Scientific Machine Learning Seminar
ORGANIZER;CN="Alexander%20Bihlo":MAILTO:abihlo@mun.ca
END:VEVENT
END:VCALENDAR