IEEE ICDM 2025 UGHS Symposium

IEEE ICDM 2025: Undergraduate and High School Symposium

Time: Nov 14 Banquet and Nov 15 Full Day, 2025

Location: Washington D.C., USA

Hosted with the IEEE International Conference on Data Mining (ICDM) 2025.

Home

We are delighted to announce the 2nd Undergraduate and High School Symposium, to be held as part of the IEEE ICDM 2025 Conference. This symposium aims to provide a platform for young researchers to showcase their innovative work in the field of data mining and related disciplines.


Why Participate in this Symposium?


Program Agenda

Nov 14: conference banquet (evening). Nov 15: full-day UGHS program.

November 14, 2025 — Conference Banquet
Time Event Location
18:00–21:00 Banquet

November 15, 2025 (Location: 1 main room + 5 additional parallel session rooms)
Time Event Speaker
09:00–10:00 Keynote (50 mins)
Location: Senate Room
NSF Program Director (TBD)
10:00–10:30 Coffee Break
Location: Capital Terrace, Congressional
10:30–11:25 Fireside Chat (50 mins) with Dr. Jure Leskovec (Stanford)
Location: Senate Room
Moderator: TBD
11:30–12:25 Panel with Parents (50 mins): Next Generation AI Education
Location: Senate Room
Panelists:
See panelists →
12:25–12:30 Poster Preparation
12:30–14:00 Poster Session (Lunch)
Location: Capital Terrace, Congressional
All Authors
14:00–15:30 Oral Presentations (12 mins × 7 papers × 6 rooms)
UH1 - 6 · Locations see details
Oral Sessions:
See sessions →
15:30–16:00 Coffee Break
Location: Capital Terrace, Congressional
16:00–17:30 Oral Presentations (12 mins × 7 papers × 6 rooms)
UH7 - 12 · Locations see details
Oral Sessions:
See sessions →
17:30 Closing (Award Announcements)
Location: Presidential Room

Oral Presentations · 14:00–15:30

← Back to agenda

UH1 — Federal A UH2 — Federal B UH3 — South American B UH4 — California UH5 — New York UH6 — Massachusetts

14:00–15:30 · Oral Presentations(12 mins × 7 papers × 6 rooms)

Session Chairs — UH1–UH6
UH1
Sanjay Madria
UH2
Chen Chen
UH3
Fang Jin
UH4
Haibing Lu
UH5
Mohammad Ali Javidian
UH6
Long Nguyen
UH1 (Room: Federal A, Session Chair: Sanjay Madria): S01287, S01286, S01285, S01242, S01241, S01282, S01281
#TimePaper IDTitleContact
114:00–14:12S01287Can Small Quantized VLMs Drive? An Experimental Evaluation of Small Quantized VLMs for Autonomous DrivingSamson Mathew
214:12–14:24S01286Data Insights into Teen Consumer Trends: From Kaggle to KnoxvilleChuanren Liu
314:24–14:36S01285An Empirical Approach Toward Understanding the Impact of Essential Oils on Alzheimer's Disease ProgressionKhalil Al-Hussaeni
414:36–14:48S01242Assessing Bias Within Diabetes Risk Prediction in Machine Learning TechniquesAyesha Faruki
514:48–15:00S01241Heatwaves and Health Risks in New York CityJie Yang
615:00–15:12S01282Interpretable Feature Mining for AI Product DesignVivian Foutz
715:12–15:24S01281QuizWhiz: An End-to-End AI-Powered Educational Platform for K-12 Intelligent Tutoring and Teaching AnalyticsMing Zhang
UH2 (Room: Federal B, Session Chair: Chen Chen): S01280, S01279, S01278, S01277, S01276, S01275, S01274
#TimePaper IDTitleContact
114:00–14:12S01280Early Wildfire Detection with UAVs using a Frame Difference MethodBrian Hong
214:12–14:24S01279FS-PREM: A Physics-Aware Framework for Predicting Port DisruptionShriraghav Ashok
314:24–14:36S01278Privacy-First Triage Classification with Open-Weight LLMs: A Chain-of-Thought Distillation ApproachZeyuan Zhao
414:36–14:48S01277DETECT: Data-Driven Evaluation of Treatments Enabled by Classification TransformersYuanheng Mao
514:48–15:00S01276SmartPharynx: A Camera-Based Smartphone System for Screening of Bacterial Pharyngitis with a Low-Shot CycleGAN and Custom CNNSrikar Kovvali
615:00–15:12S01275An Analysis of Gender-Based Differential Item Functioning in the PISA 2018 and 2022 CyclesIsabel Xiong
715:12–15:24S01274A Critical Analysis of a Multi-Input CNN Architecture for Quantum-Enhanced ForecastingAteef Mahmud
UH3 (Room: South American B, Session Chair: Fang Jin): S01273, S01271, S01269, S01267, S01264, S01263
#TimePaper IDTitleContact
114:00–14:12S01273Towards Robust Anomaly Detection in Fish Behavior: Hybrid LLM–ML Ensembles and Federated LearningQiong Cheng
214:12–14:24S01271Transforming Color Correction for Colorblindness with Hydrodynamic Modeling and Deep Learning-Based ValidationLucas Yang
314:24–14:36S01269Comparative Analysis of GraphCast and the Global Forecast System Using Real-Time Mesoscale AnalysisJonathan Yu
414:36–14:48S01267Learning for Inflation Forecasting with Dynamic Feature SpacesZakariyya Scavotto
514:48–15:00S01264Predicting Residual Cognitive Deficit Post-Ischemic Stroke: An Imbalance-Aware Machine Learning Pipeline on EHR DataSirichandana Yakkala
615:00–15:12S01263A ResNet and ViT-U-Net Hybrid Model for Accurate FVM Flooding SimulationsIsabella Cho
UH4 (Room: California, Session Chair: Haibing Lu): S01262, S01261, S01260, S01258, S01257, S01256
#TimePaper IDTitleContact
114:00–14:12S01262AI-Assisted Safe Drop Zone Identification for Human-Guided Drone DeliveriesAtharva Kakatkar
214:12–14:24S01261Adaptive Execution Scheduler for DataDios SmartDiffAryan Poduri
314:24–14:36S01260Contrastive Retrieval Augmented In-Context Learning for Medical Classification Tasks with Imbalanced DataSwarnika Joshi
414:36–14:48S01258Graph Perspective on Multi-modal Mouse Neural Data and Behavior AnalysisWenhao Hu
514:48–15:00S01257Efficient Semantic-based Video Segment Queryingziqi Zhou
615:00–15:12S01256Enhancing Radiographic Disease Detection with MetaCheX, a Context-Aware Multimodal ModelNathan He
UH5 (Room: New York, Session Chair: Mohammad Ali Javidian): S01255, S01254, S01253, S01252, S01251, S01250
#TimePaper IDTitleContact
114:00–14:12S01255Statistical Mining of Patient Reviews for Geographic Insights on Quality of Urological Care: A Pilot StudyMax Yu
214:12–14:24S01254Enhancing On-Chip Learning for RRAM Devices Through Evolutionary TheoriesXinghui Zhao
314:24–14:36S01253Application of Object Segmentation Model in Contact Angle Measurement for Hydrophobicity DeterminationJoann Xie
414:36–14:48S01252Hybrid BiLSTM-RF Framework for Lithium-ion Battery State of Health and RUL PredictionIrene Lu
514:48–15:00S01251Uncertainty Quantification in Deep Learning based Breast Cancer Diagnosis using DCE-MRI and TRAMsJerry Wang
615:00–15:12S01250MetaRef: A Generalizable Physics-Aware Refinement Framework for Metamaterial DesignAlexander Lu
UH6 (Room: Massachusetts, Session Chair: Long Nguyen): S01249, S01248, S01246, S01245, S01244, S01243, DM556
#TimePaper IDTitleContact
114:00–14:12S01249Few-Shot Learning Meets Large Language Models: Mining Medicine Interventions From RedditFang Jin
214:12–14:24S01248Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo EffectsSierra Liu
314:24–14:36S01246FinFraud-LLM: Exploring Large Language Models for Financial Fraud DetectionJohnson Chen
414:36–14:48S01245Can Reasoning LLMs Eliminate Conformity in Multi-Agent Systems?Alina Liu
514:48–15:00S01244Echo State Networks in Reservoir Computing: Foundations, Benchmarks, and Applications to Next-G Wireless CommunicationAndrew Liu
615:00–15:12S01243Interpretable Deep Learning Framework for the Diagnosis of Age-Related Macular DegenerationAnvitaa Rudharraju
715:12–15:24DM556Explainable Skill Acquisition over Time via GraphRAG-Augmented Mastery Features, Fuzzy Clustering, and Hybrid Deep ModelsQiong Cheng

Oral Presentations · 16:00–17:30

← Back to agenda

UH7 — Federal A UH8 — Federal B UH9 — South American B UH10 — California UH11 — New York UH12 — Massachusetts

16:00–17:30 · Oral Presentations(12 mins × 7 papers × 6 rooms)

Session Chairs — UH7–UH12
UH7
Yi He
UH8
Kunpeng Liu
UH9
Kanthi K Sarpatwar
UH10
Zhou Yang
UH11
Senjuti Basu Roy
UH12
Meikang Qiu
UH7 (Room: Federal A, Session Chair: Yi He): S01284, S01283, S01240, S01239, S01238, S01237, S01236
#TimePaper IDTitleContact
116:00–16:12S01284Multi-optimizer Deep&Cross at Industrial ScaleMark znidar
216:12–16:24S01283Multi-Modal Embedding Fusion for Scalable Context-First CTRMark znidar
316:24–16:36S01240A Deep Learning Approach for Reaction-Diffusion-Advection Modeling of Vegetation-Desertification PatternsAritro Chatterjee
416:36–16:48S01239Dynamics of Fencer Rating ProgressionEthan Xu
516:48–17:00S01238ESN-DAGMM: A Lightweight Framework for Unsupervised Time-Series Data Monitoring in 5G O-RAN NetworksRaymond Zhao
617:00–17:12S01237Deep Gaussian Fusion Network for Traffic PredictionZhiqian Chen
717:12–17:24S01236Quantifying Biopharma Alliance Fragility Using a Strategic Shock Risk Index (SSRI)Rhea Zhou
UH8 (Room: Federal B, Session Chair: Kunpeng Liu): S01235, S01234, S01233, S01232, S01231, S01230, S01229
#TimePaper IDTitleContact
116:00–16:12S01235Do You Know What I Mean? Testing the Prompt Robustness of an LLM-Powered IoT SystemXingguo Ding
216:12–16:24S01234Interactive 3D Spine Modeling for Enhanced Doctor-Patient Communication and Health LiteracyChristian Jin
316:24–16:36S01233AI or humans: Who understands online emotions Better?Victor Tang
416:36–16:48S01232Mining Mobile Point-of-Interest Visit Data for Socioeconomic InsightsAmy Ma
516:48–17:00S01231An Agentic Framework for Social Event Forecasting: Approaches using Causality Contextualized Chain of ThoughtAvani Thakur
617:00–17:12S01230Data-Driven Weakly-Supervised Methods Successfully Denoise Diverse Biomedical Imaging ModalitiesReeti Rout
717:12–17:24S01229Lost in Transcription: Influence of Dialect on Whisper’s PerformanceHelen Qin
UH9 (Room: South American B, Session Chair: Kanthi K Sarpatwar): DM314, S01228, S01227, S01226, S01224, S01223, S01222
#TimePaper IDTitleContact
116:00–16:12DM314Benchmarking LLMs and Distributed Approaches for Anomaly DetectionQiong Cheng
216:12–16:24S01228Grounded Chest X-Ray Reasoning: Leveraging Visual Tools to Improve Medical Multimodal LLMsHuaxiu Yao
316:24–16:36S01227Bringing Optimization to Everyone: Exploring LLMs as a Tool for Non-Experts and StudentsWinston Zhang
416:36–16:48S01226Deep Learning to Denoise and Segment Air Pollutant PlumesDaniel Li
516:48–17:00S01224EEG EyeNet: Strong, Insightful Baselines for Eye-Movement Prediction from EEGChristian Jin
617:00–17:12S01223Automated Analysis of Astrocyte Cell Connectivity after Laser Ablation Using Machine Learning and Path-Finding AlgorithmsConnor Lee
717:12–17:24S01222Predicting Vertical Cloud Type Structure with GOES-ABI Multi-Channel Data (Deep Learning & Foundation Model)Sidh Jaddu
UH10 (Room: California, Session Chair: Zhou Yang): S01221, S01220, S01219, S01218, S01217, S01216
#TimePaper IDTitleContact
116:00–16:12S01221Multimodal Foundation Models as Router Models for High-Resolution Aerial Image SegmentationCooper Li
216:12–16:24S01220Forecasting U.S. Recessions with Machine Learning: Evidence from Ten Economic Indicators, 1978–2025Neel Dhuruva
316:24–16:36S01219C-Reactive Protein Induces Endothelial Cell Dysfunction and Replication StressJay Peng
416:36–16:48S01218Systematic Comparison of Artifact Removal Techniques for Reliable Feature Extraction from scTS-Contaminated EMGVivian Li
516:48–17:00S01217Graph-LLM for EHRs: Combining Temporal Graph Representations and LLM-Based Note Imputation for Clinical PredictionsMichael Liu
617:00–17:12S01216LLMSeqRec: LLM Enhanced Contextual Sequential RecommenderConnor Lee
UH11 (Room: New York, Session Chair: Senjuti Basu Roy): S01215, S01214, S01213, S01212, S01211, S01210
#TimePaper IDTitleContact
116:00–16:12S01215Benchmarking the Code Generation Capabilities of Popular Large Language Models for Front-End Web DevelopmentDron Datta
216:12–16:24S01214Persona-Driven LLM Interaction in Stock Market SimulationsMedhashree Parhy
316:24–16:36S01213Evaluating the Effectiveness of Persona Simulation in Opinion Prediction with GPT-4.1Sarah Li
416:36–16:48S01212AI-Powered Trait Analysis for Poisonous Mushroom ClassificationSrikar Akundi
516:48–17:00S01211Machine Learning-Based Classification of Transcriptional Gene Groups for Cancer PrognosisAnnie Wu
617:00–17:12S01210Performance Evaluation of Convolutional Neural Networks in Image-Based Malware ClassificationRaymond Jiang
UH12 (Room: Massachusetts, Session Chair: Meikang Qiu): S01208, S01207, S01206, S01204, S01203, S01201
#TimePaper IDTitleContact
116:00–16:12S01208Signature vs. Substance: Evaluating the Balance of Adversarial Resistance and Linguistic Quality in Watermarking Large Language ModelsWilliam Guo
216:12–16:24S01207Hyperspectral Band Selection with Learnable Weights for Efficient Glioblastoma DetectionAlbert Li
316:24–16:36S01206Dimension Reduction Enhanced Boosting for Imbalanced Data ClassificationEric Wang
416:36–16:48S01204Probabilistic Prompts for Zero-shot and Few-shot Large Language Models: An Empirical Study of Patient-reported OutcomesZhong Chen
516:48–17:00S01203Quantitative Assessment on the Impact of Music on Athletic PerformanceAngela Du
617:00–17:12S01201Predicting Relationship Stability Using Communication PatternsDeepak Gahalot

Parent Panel — Panelists

← Back to agenda

This panel appears on Nov 15, 11:30–12:25 (see agenda above).

Parent Panelists
Name Email Affiliation Title / Role
Min Wu minwu@umd.edu University of Maryland – College Park Distinguished University Professor; Associate Dean for Graduate Programs; Christine Yurie Kim Eminent Professor in Information Technology, Electrical and Computer Engineering
Ambreen Hasan ambihasan@gmail.com Youngstown State University Director of Institutional Research and Analytics
Jianjun Xie jianjunxie@gmail.com FICO Principal Scientist
Dhuruva Badri drubadri@gmail.com Florida Department of Transportation Assistant District Construction Engineer
Tarun Thakur tarun.thakur@alumni.duke.edu Veza Co-Founder and CEO

Poster Presentation

  1. The recommended poster size is A0 (33.1″ × 46.8″ or 841 mm × 1189 mm). Posters may be prepared in either portrait or landscape orientation. Authors are free to design their posters as they see fit, as long as the poster fits within the allotted display area.

    Each poster board measures 8 feet wide by 4 feet high and is double-sided. Two posters will be displayed on each side, so each presenter has a maximum space of 4 feet × 4 feet (1.22 m × 1.22 m), including margins. For visual reference, please see the attached diagram illustrating the poster board layout.

    8’ board width
    Illustration: one 8’ × 4’ board (one side) with two 4’ × 4’ poster areas.
  2. Presentation materials may be attached using pushpins or tape, which will be provided on site.


Scope of Topics

We invite submissions of original research papers from undergraduate and high school students on topics related to data mining, including but not limited to:

Topics of Interest
  1. Foundations, algorithms, models, and theory of data mining, including big data mining
  2. Machine learning, deep learning, and statistical methods for big data
  3. Mining heterogeneous data sources, including text, semi-structured, spatio-temporal, streaming, graph, web, and multimedia data
  4. Data mining systems and platforms for analyzing big data, including methods for parallel and distributed data mining, federated learning, and their efficiency, scalability, security, and privacy
  5. Data mining for modeling, visualization, personalization, and recommendation
  6. Data mining for cyber-physical systems and complex, time-evolving networks
  7. Data mining with large language models
  8. Novel applications of data mining in data science, including big data analysis in social sciences, physical sciences, engineering, life sciences, climate science, web, marketing, finance, precision medicine, health informatics, and other domains

Submitted papers should present novel ideas, methodologies, algorithms, or applications in the realm of data mining. Papers will be evaluated based on their technical quality, novelty, relevance, and clarity of presentation.


Eligibility


In-Person Policy (UGHS)

The UGHS Symposium will be run as an in-person session. At least one student author must attend and present the paper in person; otherwise, the paper will not be included in the IEEE proceedings.

Additional requirements for high school student authors:

  1. At least one parent accompanies the student to the conference.
  2. A signed release form (download here).
  3. An information document signed by a parent/guardian, along with an optional signature from the high school principal. The document should include the paper ID and title; contact information for the parent (required) and the principal (optional); a statement of permission from the high school (optional); and confirmation that a parent will accompany the student.
    Template: Information Document (DOCX)
  4. Note: School approvals are optional. However, some schools require their permission.

    The conference policy requires that each student participant be accompanied by their own parent or legal guardian. One adult chaperone cannot supervise multiple students.

Where to send: Please email the signed release form and the optional information document to ieee-icdm-2025-undergraduate-and-high-school-symposium-g@vt.edu before November 7, 2025.


Submission Format Requirements


Publication Ethics and Dual Submission Policy

We uphold the highest standards of academic integrity and ethical research conduct. All authors must ensure that their submissions fully comply with the following guidelines.

Originality and Dual Submission: Submitted manuscripts must represent original work that has not been submitted or published elsewhere. Dual submission—submitting the same or substantially similar content to multiple venues concurrently— is strictly prohibited and constitutes a violation of publication ethics and integrity.

Subsequent Journal Submission: If authors wish to submit an extended version of their published work to a journal after conference publication (e.g., IEEE ICDM proceedings), they must clearly disclose that a preliminary version has been published. Transparency with journal editors and reviewers is required. Any subsequent submission to another venue must include significant new contributions, such as expanded experiments, novel theoretical insights, or additional methodological developments. Minor revisions or incremental changes are not sufficient to qualify as a new work.

Ethical Responsibility: Authors are responsible for maintaining ethical standards in all aspects of the publication process, including authorship, data integrity, and proper citation of prior work. Violations may lead to retraction, notification of the authors’ institutions, and disqualification from future submissions.


Important Dates


Venue & Hotels

IEEE ICDM 2025 will be held in-person in Washington, D.C. Discounted hotel rooms are available. Please check: https://www3.cs.stonybrook.edu/~icdm2025/venue.html .


How to Submit

  • The review process is single-blind (reviewers anonymous; authors visible to reviewers).
  • Accepted papers will appear in the ICDM Workshop Proceedings (IEEE Computer Society Press) and be available at the conference.
  • Each accepted paper must have at least one author registered and present in person during the symposium day.

Camera-Ready Submission

Final deadline: October 15, 2025.

  • The camera-ready paper must follow all formatting requirements (not a response to reviewer comments). Ensure IEEE formatting/page limits per the ICDM 2025 website, use high-clarity figures, and keep the layout clean and consistent.
  • Submission site: IEEE CPS Login
  • PDF eXpress validation: Conference Record 69685.
  • Paper ID on the registration form: use your conference submission ID (e.g., S01217) from the submission emails — not the IEEE PDF eXpress ID (e.g., 2025357692).
  • Issues with copyright or submission? Contact Martha Nunez.

Registration

We have already provided the discount registration code in the camera-ready email to all authors. That discounted registration fee is the one-day symposium fee, rather than the full conference fee. It covers one student and one parent companion, and also includes the Friday banquet.


Program Co-Chairs

Join us in shaping the future of data mining research by sharing your insights and discoveries at the Undergraduate and High School Symposium. For questions, please contact the symposium co-chairs.

For general inquiries: ieee-icdm-2025-undergraduate-and-high-school-symposium-g@vt.edu

Session Co-Chairs

Portrait of Fang Jin
Fang Jin
Associate Professor
The George Washington University
Portrait of Sanjay Madria
Sanjay Madria
Curators’ Distinguished Professor
Computer Science Department, Missouri University of Science and Technology
Portrait of Haibing Lu
Haibing Lu
Department Chair and Professor
Information Systems & Analytics,Santa Clara University
Portrait of Mohammad Ali Javidian
Mohammad Ali Javidian
Assistant Professor
Department of Computer Science, Appalachian State University
Portrait of Long Nguyen
Long Nguyen
Assistant Professor, Computer Science & Engineering
Speed School of Engineering, University of Louisville
Portrait of Kanthi K Sarpatwar
Kanthi K Sarpatwar
Research Staff Member
IBM TJ Watson Research, New York
Portrait of Zhou Yang
Zhou Yang
Ph.D.; Department of Statistics
Columbian College of Arts and Sciences, The George Washington University
Portrait of Meikang Qiu
Meikang Qiu
Professor (Volunteer Adjunct); ACM Distinguished Member; IEEE Senior Member
Department of Computer Science, Pace University
Portrait of Yi He
Yi He
Assistant Professor
School of Computing, Data Sciences, and Physics, William & Mary
Portrait of Senjuti Basu Roy
Senjuti Basu Roy
Panasonic Chair in Sustainability; Associate Professor
Department of Computer Science, New Jersey Institute of Technology

Web Chairs

Haoyue Bai
Graduate Research Associate
School of Computing and Augmented Intelligence (SCAI), Arizona State University
Cong Wei
SCAI Grader — Fall 2025
School of Computing and Augmented Intelligence (SCAI), Arizona State University

Student Volunteers

Arun Vignesh Malarkkan
Ph.D. Candidate, Computer Science
Arizona State University
Victoria Lu
Student Volunteer
University of Virginia

Visa Information Contact

Back to top ↑