HOW IT STARTED
The impetus for CAAI stemmed from past projects which had money and data but lacked the expertise to bridge technical and domain knowledge. Previous initiatives, such as the Institute for Biomedical Informatics’ Innovation Core and the AI in Medicine Alliance, revealed that giving researchers funding to explore AI use without a technical partner often resulted in stalled projects.
CAAI was built to fill that need. Housed within the university’s Institute for Biomedical Informatics, the center provides infrastructure, reusable code and technical staff that allow collaborators to move from concept to prototype quickly and trust data is housed securely.
Bumgardner said collaborations with the center can begin informally, with researchers hearing about CAAI through word of mouth, or publications on work the center has contributed to. Then, researchers can meet with CAAI team members to translate their individual problems into a technical plan, often involving a self-service tool that allows them to experiment on their own time. Having adaptable self-service tools helps researchers get started more quickly, Bumgardner said, so they are not waiting on bureaucratic processes or individuals to create one-off tools.
THE CENTER’S SELF-SERVICE TOOLS
CAAI’s reusable platforms remove repetitive technical work from individual projects, and some are simpler than others.
For example, CAT-Talk is an in-house speech-to-text transcription service.
Another tool, LLM Factory, provides a locally hosted platform for users to fine-tune large language models to their own needs, also known as creating custom GPTs. It uses OpenAI training models as a base, and data uploads from individual users for customization, without using those data uploads to train OpenAI’s general models on proprietary research.
Other tools tackle more complicated tasks. CLASSify, the tool Silverstein used, is a web-based tool that lets users train and compare machine learning classifiers on any data that comes in a table. Using the AI, Silverstein said, can help validate variable choices and avoid purely subjective selection.
CLASSify helps with interpreting data collected, not necessarily by forming written conclusions, but by determining which of many variables changed the most, as in Silverstein’s case. If a researcher is working with sensitive health data, it can generate synthetic example data that follows the same patterns, so researchers can share key findings without compromising individual information.
“The most surprising thing was how easy it was to actually use,” Silverstein said. “I do not come from a background with really any programming experience at all.”
SmartState is a platform that automates research protocol compliance processes. Making use of agentic AI, virtual agents guide study participants through protocols and send messages to reduce the need for manual reminders and follow-up. It also documents study progress.
THE CENTER’S IMPACT
CAAI’s projects span surgery, neonatal health, cardiology, public health and more. In neonatal care, CAAI supported machine learning models analyzing data from ventilators and oxygen monitors to predict when pre-term babies are ready to have breathing tubes removed. That helped doctors make more informed decisions, according to an executive report from CAAI. Another project through the center uses ambulance and emergency data to forecast opioid-related incidents across the state, helping inform response.
Operational uptake like this is one way the center measures success, Bumgardner said, along with helping its people grow and continuing to advance research.
“What can we do to actually directly apply to help folks?” he said.