Bulletin of the American Physical Society
2024 APS April Meeting
Wednesday–Saturday, April 3–6, 2024; Sacramento & Virtual
Session B16: Mini-Symposium: Physics Education Research Incorporating Big Data and Data MiningEducation Mini-Symposium Undergrad Friendly
|
Hide Abstracts |
Sponsoring Units: GPER Chair: Eric Burkholder, Auburn University Room: SAFE Credit Union Convention Center Ballroom B5, Floor 2 |
Wednesday, April 3, 2024 10:45AM - 11:21AM |
B16.00001: Opportunities and challenges of large language models in physics education Invited Speaker: Stefan Küchemann Recent advances in large language models (LLMs) such as ChatGPT demonstrate compelling performances in complex tasks such as writing coherent essays, reasoning, and even performing well in untrained tasks. While these models also hold a great promise for physics education, there are concerns about biases and output reliability as well as impacts on learning willingness, exam fraud, and misuse. Thus, it is crucial for their integration into physics learning and teaching to provide scientific evidence regarding their effectiveness and safety. In this contribution, we demonstrate how LLMs may support physics teachers and learners and conclude with a perspective of how AI may transform physics education. |
Wednesday, April 3, 2024 11:21AM - 11:33AM |
B16.00002: Integrating Generative AI as a Tool for Formative Feedback in Large Enrollment Physics Courses Mohamed Abdelhafez, Peter Dourmashkin, Aidan MacDonagh, Shams El-Adawy We investigated ways to integrate AI into our introductory physics classical mechanics course. We utilized ChatGPT to offer constructive written feedback to students during an optional pre-exam review activity. Given students' struggles with developing systematic problem-solving approaches and instructors' time limitations in large classes, we created prompts to connect course concepts and gather ChatGPT feedback. Data collected encompassed student responses, ChatGPT feedback, criteria provided for feedback and instructors' assessments. Through thematic analysis of feedback received from ChatGPT and instructors' assessments, our preliminary results suggest that ChatGPT can be used to provide timely constructive written feedback if provided with detailed criteria. In this talk, we present both the encouraging similarities between human instructor feedback and ChatGPT's feedback as well as some of the limitations that emerged in our study. In the future, it will be important to repeat this study with a larger sample size. This will help us confirm ways AI could potentially be incorporated in physics courses or suggest additional unforeseen subtleties about the potential effectiveness of AI as a feedback tool in physics courses. |
Wednesday, April 3, 2024 11:33AM - 11:45AM |
B16.00003: Analyzing unsupervised approaches to coding motivation of women in physics. Colin Green, Eric Brewe Machine learning, along with its applications to written language, is rapidly developing. PER researchers are curious to what extent these computational methods can be shown to have validity within qualitative research. We present our investigation of the use of an unsupervised machine learning algorithm called Latent Dirichlet Allocation in creating a coding scheme to characterize women's motivation for joining physics. We analyze over 2,000 survey responses from Conference for Undergraduate Women in Physics participants that were previously hand coded by Franklin et al. using established self efficacy and expectancy value theories. We test what motivational codes emerge from an unsupervised natural language processing approach, and compare the results from Latent Dirichlet Allocation topic modeling to the established code scheme. Comparison of the theoretical coding and the topic modeling show points of similarity but fail to fully reproduce the human motivational codes. |
Wednesday, April 3, 2024 11:45AM - 11:57AM |
B16.00004: Methods for trustworthy application of Large Language Models in PER Rebeckah Fussell, Megan Flynn, Anil Damle, Natasha G Holmes Within physics education research (PER), a growing body of literature investigates using natural language processing machine learning algorithms to apply coding schemes to student writing. The aspiration is that this form of measurement may be more efficient and consistent than similar measurements made with human analysis, allowing larger and broader data sets to be analyzed. In our work, we are harnessing recent innovations in Large Language Models (LLMs) such as BERT and LLaMA to learn complex coding scheme rules. Furthermore, we leverage methods from uncertainty quantification to help understand the trustworthiness of these measurements. In this talk, I will demonstrate a successful application of LLMs to measure experimental skills in lab notes and apply our methodology to evaluate the statistical and systematic uncertainty in this form of algorithm measurement. |
Wednesday, April 3, 2024 11:57AM - 12:09PM |
B16.00005: Predicting persistence of women in physics with machine learning Maxwell W Franklin, Eric Brewe In this work, we use a variety of machine learning tools to predict retention of women in physics. Previously, we used data collected at the Conference for Undergraduate Women in Physics, along with a follow-up survey, to study which factors correlated with long term persistence in physics. The factors we studied were sense of belonging, sense of community, interest, physics identity, perceived recognition, and performance competence. In this study, we build on our previous results by comparing the machine learning methods of support vector machines, neural networks, random forests, and logistic regression to best predict which women are most at risk of leaving the discipline. |
Wednesday, April 3, 2024 12:09PM - 12:21PM |
B16.00006: Utility of pre-instruction diagnostic tests for estimating probabilities of final course grades in introductory physics David E Meltzer, Dakota H King We have examined the relationship between various pre-instruction assessment measures and final course grades for students enrolled in introductory general physics courses at five campuses of four universities; the total sample included 25 separate classes and over 2000 students. The three assessments were the Force Concept Inventory, the Lawson Test of Scientific Reasoning, and a mathematics diagnostic test that we have developed and tested over the past seven years. We find, with nearly 90 percent consistency, that top-quartile scorers on the pre-instruction assessments have double or greater probability of receiving high (top quartile) course grades, and half or less probability of receiving low (bottom quartile) course grades, compared to students who scored in the bottom quartile on the assessments. Predictor variables have some inter-correlation but models incorporating two or more predictors generally have somewhat more predictive power than single-variable models, although the most successful sets of predictors vary from course to course. |
Follow Us |
Engage
Become an APS Member |
My APS
Renew Membership |
Information for |
About APSThe American Physical Society (APS) is a non-profit membership organization working to advance the knowledge of physics. |
© 2025 American Physical Society
| All rights reserved | Terms of Use
| Contact Us
Headquarters
1 Physics Ellipse, College Park, MD 20740-3844
(301) 209-3200
Editorial Office
100 Motor Pkwy, Suite 110, Hauppauge, NY 11788
(631) 591-4000
Office of Public Affairs
529 14th St NW, Suite 1050, Washington, D.C. 20045-2001
(202) 662-8700