Department of Statistics


Seminars


There are no seminars to show.

»
Past seminars

Novel methods for making Māori health data relevant to local decision-making

Speaker: Tori Diamond

Affiliation: UoA

When: Wednesday, 27 August 2025, 1:00 pm to 2:00 pm

Where: 303-310

Abstract:

Inequitable population health outcomes remain a persistent challenge for Aotearoa New Zealand’s health system, with significant disparities continuing to impact Māori. Monitoring and addressing health inequities faces substantial methodological challenges, particularly at the local level where healthcare is delivered. The ‘small n problem’ – insufficient count sizes for reliable statistical estimates creates a critical issue: populations most in need of targeted healthcare are often those with the least reliable data to inform decision-makers.

This PhD project aims to develop and apply advanced Bayesian statistical methods to address small count challenges in official statistics research, focusing on improving small area estimation for Māori health outcomes in Aotearoa. Using linked administrative data from Stats NZ’s Integrated Data Infrastructure (IDI), this research uses two case studies: rheumatic fever diagnoses and COVID-19 vaccinations. These outcomes represent different aspects of the statistical problem - rheumatic fever as a rare event with small counts, and COVID-19 vaccinations portraying spatial variation for a more common outcome. Bayesian hierarchical frameworks are proposed to address the fundamental challenge of balancing local-level precision, statistical accuracy and the reality of small count sizes.

This research addresses a gap in official health statistics methods with real-life implications for small populations. The project will contribute to new knowledge about two important population health issues in Aotearoa while developing robust and enduring statistical methods. Expected contributions include creating methodological advances in Bayesian health research, practical applications for decision-makers and novel implementation of linked administrative data for health outcomes research.

Radial Basis Operator Networks

Speaker: Jason Kurz

Affiliation: University of Waikato

When: Wednesday, 20 August 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract :

Inverse problems are central to many scientific domains but are often ill-posed and difficult to solve reliably. Electrical Impedance Tomography (EIT) exemplifies this challenge: recovering internal conductivity from boundary voltage measurements is highly sensitive to noise and lacks uniqueness.

In this talk, we present recent work on Radial Basis Operator Networks (RBONs), a class of neural operator models that learn mappings between function spaces using a compact, interpretable architecture based on radial basis functions. The representation theorem underlying RBONs will be introduced as well as a description of the network structure and training process.

Before focusing on EIT, we will also show RBON's performance on several benchmark operator learning tasks, highlighting its ability to generalize across function classes and maintain low test error even out-of-distribution. These results point to RBON as a promising tool for data-driven solutions to PDE-governed systems, with broad relevance across scientific machine learning.

Jason is a lecturer at the Department of Mathematics, University of Waikato.

A unified approach to penalized likelihood covariance estimation in high dimensions

Speaker: Prof. Alberto Roverato

Affiliation: University of Padova

When: Friday, 15 August 2025, 2:00 pm to 3:00 pm

Where: 303-310

Abstract: This talk considers the problem of estimation of a covariance matrix for multivariate Gaussian data in a high dimensional setting. Existing approaches include maximum likelihood estimation under a pre-specified sparsity pattern, l_1-penalized loglikelihood optimization and ridge regularization of the sample covariance. These three approaches can be addressed in a unified way by considering the constrained optimization of an objective function that involves two suitably defined penalty terms. This unified procedure exploits the advantages of each individual approach, while bringing novelty in the combination of the three. We provide an efficient algorithm for the optimization of the regularized objective function and describe the relationship between the two penalty terms, thereby highlighting the importance of the joint application of the three methods. In particular, the sparse estimates of covariance matrices returned by the procedure are stable and accurate, both in low and high dimensional settings, and their calculation is more efficient than existing approaches under a partially known sparsity pattern. An illustration on sonar data is presented for the identification of the covariance structure among signals bounced off a certain material. The method is implemented in the publicly available R package gicf.

Alberto Roverato is a Professor in the Department of Statistical Sciences at the University of Padova.

Betting on Better Models

Speaker: Prof Mike West

Affiliation: Duke University

When: Friday, 15 August 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract:

I discuss statistical analysis with multiple– or many– candidate models defining model-specific predictions and decision recommendations. Key questions include those of how to relatively calibrate and combine such analyses for formal subjective Bayesian inference and resulting decisions. A main theme is to stress that decisions are articulated as primary: We (typically) model and forecast for reasons, but often those reasons are ignored in formal statistical model uncertainty analysis. This is visited and redressed through developments in Bayesian predictive decision synthesis (BPDS), overviewed here in the time series setting. I aim to convey ideas through applied contexts and examples in areas including financial portfolios and macroeconomic policy decisions, with indications of key aspects of the foundations and underlying theory.

Mike West is the Arts & Sciences Distinguished Professor Emeritus of Statistics & Decision Sciences, Duke University.

Assessment of vaccine safety and effectiveness using a global data network: a statistical perspective in the context of COVID-19 pandemic

Speaker: Han Lu

Affiliation: UoA

When: Monday, 21 July 2025, 12:00 pm to 1:00 pm

Where: 303-310

Abstract:

To help maximise important health, social, and economic benefits of vaccines, it is imperative that detection and risk assessment of adverse events of special interest (AESI) following vaccination is carried out as close to the occurrence of the events as possible. The estimation of background and post-vaccination incidence rates is a rapid and useful tool for the surveillance of potential vaccine-related AESI. Such comparisons have the potential to investigate early safety concerns well before a more sophisticated analysis can be conducted. One level of post-marketing vaccine safety monitoring is to investigate the association between exposures and adverse events using observational cohort studies or case-based study designs such as the self-controlled case series (SCCS) analysis. The SCCS method is derived from a Poisson model to estimate the relative incidences between defined risk and control windows during the observation period. As it only requires cases with individuals acting as their own control, all time-invariant confounders are self-controlled in the analysis. There are certain limitations when the SCCS methods are applied to real world data, especially for the COVID-19 vaccine safety monitoring when the vaccines were developed quickly during the pandemic years with multiple vaccine platforms and brands administered in different countries with mixed doses (i.e. homologous vs. heterologous schedules). The modelling strategies need to be further developed to incorporate these real-world challenges, particularly for rare AESI with small sample sizes which may only be detectable via global data network.

This research project aims to address several study objectives. First, we performed a comprehensive up-to-date literature review on developed SCCS methods and their applications in case studies since this approach was first introduced by Farrington in 1995. Second, we will develop novel SCCS methods to address multiple challenges in vaccine safety evaluation such as misclassification of adverse events and small sample size in rare AESI, develop and validate new methods using simulation studies, and apply to real-world global data. Third, we will summarise the background incidence rates of a wide range of potential AESIs based on the latest research evidence, and calculate the observed versus expected rates of AESIs following COVID-19 vaccination in New Zealand population by sex, age group, total-response ethnicity, NZ Deprivation Index (NZDep) and the Index of Multiple Deprivation (IMD) using national administrative data.

This is a PYR seminar.

Muscle CARs: Conditional Auto-regressions for Muscle Fibre-Type Data

Speaker: Tilman Davies

Affiliation: University of Otago

When: Thursday, 26 June 2025, 12:00 pm to 1:00 pm

Where: 303-310

Abstract : Researchers in physiology are interested in the spatial configuration of different fiber types across a mammalian muscle, which have long been acknowledged as reflecting muscle health and mobility. Biological imperatives that drive changes in type, about which little is known in many cases, can manifest in spatially dependent ways in the muscle. Data take the form of spatially arranged coordinates, each classified as one of $m$ possible types. Historically, studies investigating fiber-type configurations have treated `type' as a binary variable——\emph{fast} or \emph{slow}. This is a widely acknowledged over-simplification of reality, and, as we shall see, can yield misleading inference. In this talk, I show how we can apply and interpret conditional autoregressions (CARs) for Gaussian data to a binary response in a probit-style attack and, with a simple reparameterisation, extend this technique to cope with more than two fibre types. A real data set serves to provide a compelling example of Simpson’s paradox when analysed in either a 3-type or collapsed 2-type form.

Dr Davies is a Senior Lecturer in the Department of Mathematics and Statistics at the University of Otago.

https://www.otago.ac.nz/maths-and-stats/people/dr-tilman-davies

GCN-Driven Feature Selection and Prediction on Block-Wise Missing Multi-Omics Data

Speaker: Qingyu Meng

Affiliation: UoA

When: Wednesday, 11 June 2025, 10:30 am to 11:30 am

Where: 303-B05

Abstract:

Precision medicine increasingly leverages integrated multi-omics data to deliver personalized healthcare. However, real-world multi-omics datasets frequently exhibit block-wise missingness, posing challenges for biomarker identification and accurate clinical outcome prediction. This research proposes a two-stage Graph Convolutional Network (GCN) framework specifically designed to handle block-wise missing multi-omics data. The framework aims to robustly select predictive biomarkers and incorporate subtype-level relationships to enhance clinical outcome prediction. In Stage I, an iterative GCN-based feature scoring mechanism is combined with a random-walk graph fusion strategy to select informative features across omics layers, even under block-wise missingness. In Stage II, a GCN prediction model integrates subtype-guided sample similarity graphs to improve the prediction of clinical outcomes. This multi-omics GCN-based framework addresses block-wise missingness by selecting more informative and comprehensive features across omics layers. By further incorporating subtype-guided sample structure into prediction, it enhances the accuracy and reliability of clinical outcome modeling.

This is a PYR seminar.

Evaluating NLP tools designed to assist instructors with formative assessment for large-enrollment STEM classes

Speaker: Matt Beckman

Affiliation: The Pennsylvania State University

When: Wednesday, 9 April 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract:

This talk seeks to articulate the benefit of free-response tasks and timely formative assessment feedback, a roadmap for developing human-in-the-loop natural language processing (NLP) assisted feedback, and results from a pilot study establishing proof of principle. If we are to pursue Statistics and Data Science Education across disciplines, we will surely encounter both opportunity and necessity to develop scalable solutions for pedagogical best practices. Research suggests that “write-to-learn” tasks improve learning outcomes, yet constructed-response methods of formative assessment become unwieldy when class sizes grow large. In the pilot study, several short-answer tasks completed by nearly 2000 introductory tertiary statistics students were evaluated by human raters and an NLP algorithm. After briefly describing the tasks, the student contexts, the algorithm and the raters, this talk discusses the results which indicate substantial inter-rater agreement and group consensus. The talk will conclude with recent developments building upon this pilot, as well as implications for teaching and future research.

Bio: Matt Beckman is an Associate Research Professor of Statistics at Penn State University, Director of the Consortium for the Advancement of Undergraduate Statistics Education (CAUSE), and 2025 NZSA Visiting Lecturer. He is co-founder of a Statistics & Data Science Education Research Lab and affiliated faculty with the Social Science Research Institute and the Center for Socially Responsible Artificial Intelligence at Penn State. Matt’s primary research interests tend to focus on assessment and he is currently PI for the NSF-funded “Project CLASSIFIES” which investigates the use of NLP tools to assist instructors of large-enrollment classes with providing formative assessment feedback on short-answer, free response tasks.

Forecasting Multiple Time Series with Graph Convolutional Networks

Speaker: Guoping Hu

Affiliation: UoA

When: Monday, 7 April 2025, 12:00 pm to 1:00 pm

Where: 303-310

Forecasting multiple time series at different levels is often required in many situations, which is commonly known as hierarchical time series forecasting. Supply chain management is a typical application that requires demand forecasting at the store, city, or country level for decision-making. In hierarchical forecasting, top-down, bottom-up, and optimal linear combination are the most common methods. While top-down and bottom-up methods use only information from the top and bottom levels, respectively, linear combination methods use individual forecasts from all series and levels and combine them linearly, often outperforming traditional top-down and bottom-up methods. Despite this, these approaches do not directly use the explanatory information that may exist at various levels of the hierarchy. In addition to producing accurate forecasts, selecting a suitable method to generate basic and reconciled forecasts simultaneously is necessary. Prediction reconciliation involves adjusting predictions to be consistent across different levels. In this talk, we introduce a novel end-to-end hierarchical time-series forecasting framework based on deep learning that jointly optimizes forecasting and reconciliation tasks. The novel framework incorporates a spatiotemporal forecasting module based on a graph convolutional network and gated recurrent unit to improve base forecast accuracy. It employs a multilayer perceptron for forecast reconciliation to ensure hierarchical consistency, effectively utilizing hierarchical information throughout the process. This forecasting framework can utilize hierarchical information to generate accurate and consistent predictions for all the time series within the hierarchy. We evaluate the proposed methodology on two real-world large scale retail datasets. The results indicate that our method achieves superior performances on hierarchical forecasting tasks compared to state-of-the-art methods, especially in scenarios with promotional information.

This is a PYR seminar.

Practice to Research to Practice: My journey as a statistics teacher and statistics teacher educator

Speaker: Stephanie Casey

Affiliation: Eastern Michigan University

When: Wednesday, 2 April 2025, 11:00 am to 12:00 pm

Where: 303-310

ABSTRACT: In this talk, I will be sharing my journey from a high school teacher to a statistics teacher educator at the university level. My focus will be on how I’ve turned my teaching experiences as a practicing teacher into research efforts, and then the research results into products used by teacher educators to improve the preparation of teachers to teach statistics.

BIO: Dr. Stephanie Casey is a Professor of Mathematics Education at Eastern Michigan University, USA. She is a 2025 Fulbright Scholar, where she is researching students' interpretations of modern, big data visualizations in collaboration with the University of Canberra's STEM Education Research Centre (SERC). Her research focuses on the teaching and learning of data science and statistics, motivated by her experience teaching secondary mathematics for fourteen years. She has co-authored two sets of statistics teacher education curriculum materials that are widely used with preservice secondary STEM teachers throughout the United States.

Link: https://sites.google.com/site/stephaniecaseymath/


Top


Please give us your feedback or ask us a question

This message is...


My feedback or question is...


My email address is...

(Only if you need a reply)

A to Z Directory | Site map | Accessibility | Copyright | Privacy | Disclaimer | Feedback on this page