Simon Fraser University
Towards Trustworthy Data Science: Interpretability, Fairness and Marketplaces
We believe data science and AI will change the world. No matter how smart and powerful an AI model we can build, the ultimate testimony of the success of data science and AI is users’ trust. How can we build trustworthy data science? At the level of user-model interaction, how can we convince users that a data analytic result is trustworthy? At the level of group-wise collaboration for data science and AI, how can we ensure that the parties and their contributions are recognized fairly, and establish trust between the outcome (e.g., a model built) of the group collaboration and the external users? At the level of data science participant eco-systems, how can we effectively and efficiently connect many participants of various roles and facilitate the connection among supplies and demands of data and models?
In this talk, I will brainstorm possible directions to the above questions in the context of an end-to-end data science pipeline. To strengthen trustworthy interactions between models and users, I will advocate exact and consistent interpretation of machine learning models. Our recent results show that exact and consistent interpretations are not just theoretically feasible, but also practical even for API-based AI services. To build trust in collaboration among multiple participants in coalition, I will review some progress in ensuring fairness in federated learning, including fair assessment of contributions and fairness enforcement in collaboration outcome. Last, to address the need of trustworthy data science eco-systems, I will review some latest efforts in building data and model marketplaces and preserving fairness and privacy. Through reflection I will discuss some challenges and opportunities in building trustworthy data science for possible future work.
Jian Pei is a Professor in the School of Computing Science at Simon Fraser University. He is a well known leading researcher in the general areas of data science, big data, data mining, and database systems. His expertise is on developing effective and efficient data analysis techniques for novel data intensive applications, and transferring his research results to products and business practice. He is recognized as a Fellow of the Royal Society of Canada (Canada’s national academy), the Canadian Academy of Engineering, the Association of Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE). He is one of the most cited authors in data mining, database systems, and information retrieval. Since 2000, he has published one textbook, two monographs and over 300 research papers in refereed journals and conferences, which have been cited extensively by others. His research has generated remarkable impact substantially beyond academia. For example, his algorithms have been adopted by industry in production and popular open source software suites. Jian Pei also demonstrated outstanding professional leadership in many academic organizations and activities. He was the editor-in-chief of the IEEE Transactions of Knowledge and Data Engineering (TKDE) in 2013-16, the chair of the Special Interest Group on Knowledge Discovery in Data (SIGKDD) of the Association for Computing Machinery (ACM) in 2017-2021, and a general co-chair or program committee co-chair of many premier conferences. He maintains a wide spectrum of industry relations with both global and local industry partners. He is an active consultant and coach for industry on enterprise data strategies, healthcare informatics, network security intelligence, computational finance, and smart retail. He received many prestigious awards, including the 2017 ACM SIGKDD Innovation Award, the 2015 ACM SIGKDD Service Award, the 2014 IEEE ICDM Research Contributions Award, the British Columbia Innovation Council 2005 Young Innovator Award, an NSERC 2008 Discovery Accelerator Supplements Award (100 awards cross the whole country), an IBM Faculty Award (2006), a KDD Best Application Paper Award (2008), an ICDE Influential Paper Award (2018), a PAKDD Best Paper Award (2014), a PAKDD Most Influential Paper Award (2009), and an IEEE Outstanding Paper Award (2007).
RIKEN Center for Advanced Intelligence Project / The University of Tokyo
Towards Robust Machine Learning: Weak Supervision, Noisy Labels, and Beyond
When machine learning systems are trained and deployed in the real world, we face various types of uncertainty. For example, training data at hand may contain insufficient information, label noise, and bias. In this talk, I will give an overview of our recent advances in robust machine learning.
Masashi Sugiyama received Doctor of Engineering in Computer Science from Tokyo Institute of Technology, Japan in 2001. Experiencing Assistant Professor and Associate Professor at Tokyo Institute of Technology, he became Professor at the University of Tokyo in 2014. Since 2016, he has been concurrently serving as Director of RIKEN Center for Advanced Intelligence Project. His research interests include theories and algorithms of machine learning. He received the Japan Academy Medal in 2017 for his series of machine learning research. He coauthored Machine Learning in Non-Stationary Environments (MIT Press, 2012), Density Ratio Estimation in Machine Learning (Cambridge University Press, 2012), Statistical Reinforcement Learning (Chapman and Hall, 2015), Introduction to Statistical Machine Learning (Morgan Kaufmann, 2015), and Machine Learning from Weak Supervision (MIT Press, to appear).
Deakin University’s Applied Artificial Intelligence Institute (A²I²)
Sample Efficient AI with Applications in Health Care and Advanced Manufacturing
From Guglielmo Marconi who developed the radio telegraph to the Wright brothers who invented flying machines, curiosity driven experimentation has powered human innovation. Such experimental optimisation remains an integral part of the Scientific Method. This time-honoured method needs a step change to accelerate scientific innovation because this iterative method quickly hits limits.
To speed-up innovation, it is imperative to expand the capability of experimental optimisation and improve its efficiency. This talk will demonstrate how sample efficient AI can be used to deliver this acceleration in experimental design. I will discuss how the methods can be applied widely, focusing on health and advanced manufacturing particularly in settings where data is scare and experimentation is expensive. In healthcare, I show how these methods can accelerate the design of clinical/health trials to efficiently determine the optimal strategy. In advanced manufacturing, I will show how it can be applied broadly from inventing new materials and alloys to accelerating industrial processes.
The second part of the talk will focus on the new machine learning innovations that have been formulated and solved to advance experimental design. These include incorporating experimental design constraints such as process constraints, transferring knowledge from pervious experiments or experimenter “hunches”, and high dimensional Bayesian optimisation so that the number of experimental control variables can be increased.
Svetha Venkatesh is an ARC Australian Laureate Fellow, Alfred Deakin Professor and a co- Director of Applied Artificial Intelligence Institute (A2I2) at Deakin University. She was elected a Fellow of the International Association of Pattern Recognition in 2004 for contributions to formulation and extraction of semantics in multimedia data, a Fellow of the Australian Academy of Technological Sciences and Engineering in 2006, and a Fellow of the Australian Academy of Science in 2021 for ground-breaking research and contributions that have had clear impact. In 2017, Professor Venkatesh was appointed an Australian Laureate Fellow, the highest individual award the Australian Research Council can bestow.
Professor Venkatesh and her team have tackled a wide range of problems of societal significance, including the critical areas of autism, security and aged care. The outcomes have impacted the community and evolved into publications, patents, tools and spin-off companies. This includes 650+ publications, three full patents, one start-up company (iCetana) and two significant products (TOBY Playpad, Virtual Observer).
Professor Venkatesh has tackled complex pattern recognition tasks by drawing inspiration and models from widely diverse disciplines, integrating them into rigorous computational models and innovative algorithms. Her main contributions have been in the development of theoretical frameworks and novel applications for analysing large scale, multimedia data. This includes development of several Bayesian parametric and non-parametric models, solving fundamental problems in processing multiple channel, multi-modal temporal and spatial data.