- Enhance SOCR analysis toolbox and visualization components with an emphasis on Big Biomedical and Neuroscience Data. The toolbox will be designed to run in a web browser and enhance the visual presentation and interpretation of Big Data. The creation of the toolbox will allow many more researchers (including students) to learn about, appreciate, contribute, and apply complex analytics to their work, making Big Data much easier to turn into “impactful results” and actionable “decision making.”
- Implement powerful, modern, and portable webapps (HTML5/JavaScript/Rshiny/Rmarkdown/Jupyter) that can be used to model various interesting processes, enable exploratory and quantitative data analyses, and facilitate the understanding of high-dimensional and complex information.
- Develop advanced AI/ML data analytics, e.g., generative foundational AI models and applications, compressive big data analytics, statistical obfuscation techniques, and Bayesian approaches to address specific biomedical, healthcare, neuroimaging-genetics, and other applications.
- Expand the novel Spacekime Analytics method for mathematical representation, statistical inference, and computational prediction of large longitudinal information.
More details are provided on the SOCR Research website (https://socr.umich.edu/html/SOCR_Research.html).
Team Organization
Each SOCR sub-team is coached by an experienced student that reports to the SOCR faculty and the PI. Sub-teams are mostly focused around developing the mathematical foundations, building particular algorithms, and designing statistical approaches for addressing applications. The sub-teams are flexibly structured to promote creativity, provide opportunity for student growth, and nurture team-science. We have the following project sub-teams: SOCRAT, CBDA, DataSifter, Data Analytics, Data Science Fundamentals, Spacekime analytics, (see SOCR website). As students develop skills and build confidence, they should expect increasing responsibilities and assignments with multiple parts.
Below are the skills needed for this project. Students with the following relevant skills and interest in the project are encouraged to apply! Although the team consists of subteams, students apply to the project as a whole, rather than individual roles on the team.
Programming (3 Students)
Preferred skills: HTML5, JavaScript, Web-based functional development, Intuitive UI/UX design, Experience with Adobe Illustrator, Canvas, and/or R/Python a plus
Likely majors: CS, SI, DATA, ANY
Analytics (3 Students)
Preferred skills: Amazon AWS Elastic Computing, Statistical modeling, high-throughput data analytics, machine learning, R/Python
Likely majors: STATS, DATA, SI, MIDAS, BIOSTAT, BIOINF, MATH, CS
Methods (DataSifter & CBDA) (4 Students)
Preferred skills: Technical math background, AI/ML, R-computing
Likely majors: STATS, DATA, SI, MIDAS, BIOSTAT, BIOINF, MATH, CS
Data Science Fundamentals (3 Students)
Preferred Skills: Students with strong mathematics and physics background and significant computational R-programming skills. Strong motivation and interests in graduate-level fundamentals of data science principles are necessary. Trainees will work directly with the PI. Students should be familiar with information measures, entropy KL divergence, ODEs/PDEs, Dirac’s bra-ket operators. Review the spacekime.org website.
Likely majors: PHYSICS, MATH or ENG background
Apprentice Researcher (4 Students)
Requirements: Interest in project material, willingness to develop skills. Open to first- and second-year undergraduate students ONLY.
Likely Majors: CS, STATS, BIOSTAT, BIOINF, MATH, PHYSICS, ENG, DATA, SI
Faculty Sponsor
Ivo D. Dinov
Professor, Computational Medicine and Bioinformatics and Health Behavior and Biological Sciences.
Prof. Dinov is the director of the Statistics Online Computational Resource (SOCR). He develops advanced mathematical models for representation, scientific computing, artificial intelligence, deep network models, statistical analysis and interactive visualization of multi-dimensional, multimodal and informatics biomedical data (Big Data). With expertise in human brain imaging, statistical computing and high-throughput distributed data processing, Dr. Dinov approaches biomedical and health science research from the perspective of team-science Big Data applications in informatics, multimodal biomedical image analysis, distributed genomics computing, complex-time representation of longitudinal data, spacekime analytics, and health analytics.
Location: This project requires in-person participation on the Ann Arbor campus for the entire project period. Remotely based students may not participate in this project at this time.
Course Substitutions: Honors, CS-ENG/DS-ENG/EE/CE-ENGR 355 and higher can count toward Flex Tech.
These substitutions/departmental courses are available for students in these respective majors. MDP does not yet have a formal agreement with other departments for substitutions/departmental courses not listed. Please reach out to your home department’s academic advisor about how you might apply MDP credits to your degree plan.
Citizenship Requirements: This project is open to all students on campus
IP/NDA: Students who successfully match to this project team will be required to sign an Intellectual Property (IP) Agreement prior to participation in January 2025.
Summer Opportunity: Summer research fellowships may be available for qualifying students
More information is available at https://socr.umich.edu/docs/uploads/2025/SOCR_MDP_2025_Projects.pdf
- SOCR Lab: https://socr.umich.edu/
- SOCR News: https://wiki.socr.umich.edu/index.php/SOCR_News
Learn more about the expectations for this type of MDP project