{"id":1192,"date":"2025-02-10T17:36:21","date_gmt":"2025-02-10T22:36:21","guid":{"rendered":"https:\/\/sites.bu.edu\/healthdatascience\/?page_id=1192"},"modified":"2026-06-02T11:31:57","modified_gmt":"2026-06-02T15:31:57","slug":"news","status":"publish","type":"page","link":"https:\/\/sites.bu.edu\/healthdatascience\/news\/","title":{"rendered":""},"content":{"rendered":"<h2><strong>The Center for Health Data Science Launches DataHub to Advance AI-Driven and Convergent Research<\/strong><\/h2>\n<h4><strong>New initiative targets the growing scale and complexity of health data, removing systemic barriers to data discovery and cross-disciplinary collaboration<\/strong><\/h4>\n<h4>By Maureen Stanton<\/h4>\n<hr \/>\n<p><img loading=\"lazy\" src=\"\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1.jpg\" alt=\"\" width=\"493\" height=\"277\" class=\"wp-image-2140 alignright\" srcset=\"https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1.jpg 1600w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1-636x358.jpg 636w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1-1024x576.jpg 1024w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1-768x432.jpg 768w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/CHDS-News-Hariri-Graphic-1-1536x864.jpg 1536w\" sizes=\"(max-width: 493px) 100vw, 493px\" \/><\/p>\n<p>The health sector is generating unprecedented volumes of digital data \u2014 from clinical records to genomics and populationhealth. But more data has not meant easier access for researchers. Fragmented systems, regulatory complexity, and technical barriers continue to slow progress.<\/p>\n<p>To address these challenges,<span>\u00a0<\/span><span><a href=\"https:\/\/sites.bu.edu\/healthdatascience\/\" target=\"_blank\" rel=\"noopener\">Boston University&#8217;s Center for Health Data Science (CHDS)<\/a><\/span><span>\u00a0<\/span>has launched<span>\u00a0<\/span><a href=\"https:\/\/sites.bu.edu\/healthdatascience\/chds-datahub\/\" target=\"_blank\" rel=\"noopener\">DataHub<\/a>, a new initiative designed to transform how researchers discover, access, and work with health data.<\/p>\n<p>DataHub acts as both a gateway and a guide. It connects researchers to high-value datasets, leveraging the center\u2019s rich data inventory and governance expertise to facilitate responsible and effective access and use. It also enables AI-enabled research by helping ensure that underlying data are curated, organized in ways that allow them to work across systems and disciplines, and traceable.<\/p>\n<p>\u201cResearchers are often working with complex data, but the systems for discovering, accessing, and connecting those data have not evolved nearly as quickly,\u201d says Debbie Cheng, Executive Director of CHDS. \u201cDataHub is designed to help lower those barriers and make it easier for researchers to work with data in rigorous, collaborative, and responsible ways.\u201d<\/p>\n<p>That foundation includes detailed metadata, standardized curation practices, and clear documentation of data provenance, a verifiable record of where data comes from and how it has been managed. Together, these elements help ensure that AI models and the insights they produce are grounded in reliable, well-understood data.<\/p>\n<p><strong>Reducing Barriers to Discovery<\/strong><\/p>\n<figure id=\"attachment40\" aria-describedby=\"caption-attachment40\" style=\"width: 207px\" class=\"wp-caption alignright\"><a href=\"https:\/\/www.bu.edu\/sph\/profile\/debbie-cheng\/\" rel=\"attachment noopener wp-att-40\" target=\"_blank\"><img loading=\"lazy\" src=\"\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1.jpg\" alt=\"Headshot of Debbie Cheng\" width=\"197\" height=\"197\" class=\"wp-image-40\" srcset=\"https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1.jpg 1022w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-636x636.jpg 636w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-150x150.jpg 150w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-768x770.jpg 768w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-700x700.jpg 700w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-189x189.jpg 189w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2023\/07\/Debbie-Cheng-2-1022x1024-1-100x100.jpg 100w\" sizes=\"(max-width: 197px) 100vw, 197px\" \/><\/a><figcaption id=\"caption-attachment40\" class=\"wp-caption-text\"><span style=\"color: #808080;\">Debbie Cheng, Professor of Biostatistics, Assistant Dean of Data Science, and founding Executive Director of the Center for Health Data Science at the School of Public Health.<\/span><\/figcaption><\/figure>\n<p>While DataHub can support emerging AI and data science applications, its impact extends across health research more broadly. \u201cResearchers often spend enormous amounts of time trying to identify, access, and understand complex datasets,\u201d says Cheng. \u201cDataHub is designed to make that process easier, more transparent, and more reliable so researchers can focus on the science.\u201d<\/p>\n<p>By combining a curated catalog of impactful datasets and resources, with built-in guidance on data use agreements, regulatory requirements, and ethical considerations, DataHub reduces uncertainty and shortens the time needed to begin new projects.<\/p>\n<p>DataHub draws on the expertise of <span><a href=\"https:\/\/sites.bu.edu\/healthdatascience\/bedac\/\" target=\"_blank\" rel=\"noopener\">Boston University\u2019s Biostatistics and Epidemiology Data Analytics Center (BEDAC),<\/a><\/span> which has supported academic, government, and industry research since 1984. Over four decades BEDAC has developed deep expertise and capabilities in data management, statistical analysis, and secure computing for thousands of studies. DataHub translates institutional knowledge into an accessible, scalable infrastructure that reduces the technical and administrative burdens that slow research and expands access to BEDAC\u2019s rigorous, hands-on support.<\/p>\n<p><strong>Enabling Convergence Across Disciplines<\/strong><\/p>\n<p>DataHub also supports a broader push toward convergence research at Boston University, bringing together disciplines such as medicine, public health, engineering, and data science to tackle complex health challenges. This work depends on connecting data and methods across systems shaped by different standards, technologies, and regulatory constraints. By lowering these barriers, DataHub makes it easier to study how multiple factors intersect, from neighborhood conditions and environmental exposures to long-term health outcomes and healthcare use.<\/p>\n<p>Privacy, security, and compliance are core to DataHub. For supported environments such as the All of Us Research Program, appropriate access controls and regulatory pathways are already in place. For other data sources, DataHub helps researchers understand and navigate the requirements for access and responsible use. DataHub also assists teams with preparing IRB and data access documentation, advises on technical controls and de-identification strategies, and connects investigators with institutional resources that support secure and responsible data use.<\/p>\n<figure id=\"attachment2151\" aria-describedby=\"caption-attachment2151\" style=\"width: 355px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" src=\"\/healthdatascience\/files\/2026\/06\/DataHub-graphic-e1780338543871-636x349.png\" alt=\"\" width=\"345\" height=\"189\" class=\"wp-image-2151\" srcset=\"https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/DataHub-graphic-e1780338543871-636x349.png 636w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/DataHub-graphic-e1780338543871-1024x562.png 1024w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/DataHub-graphic-e1780338543871-768x421.png 768w, https:\/\/sites.bu.edu\/healthdatascience\/files\/2026\/06\/DataHub-graphic-e1780338543871.png 1493w\" sizes=\"(max-width: 345px) 100vw, 345px\" \/><figcaption id=\"caption-attachment2151\" class=\"wp-caption-text\"><span style=\"color: #808080;\">Snapshot of centralized resources available to collaborators through DataHub.<\/span><\/figcaption><\/figure>\n<p>&#8220;Many of today\u2019s most important health challenges require researchers to work across disciplines and data systems that have historically remained separate,\u201d says Yannis Paschalidis, Director of <span><a href=\"https:\/\/www.bu.edu\/hic\/\" target=\"_blank\" rel=\"noopener\">Boston University&#8217;s Hariri Institute for Computing<\/a><\/span>. \u201cDataHub can connect data, expertise, and researchers in ways that support more collaborative and convergent research.\u201d<\/p>\n<p>For the Center for Health Data Science, the initiative reflects a broader mission. \u201cUltimately, our goal is to help researchers use complex data to answer important questions and generate insights that can improve health,\u201d says Cheng. \u201cWe hope DataHub will enable new discoveries and help translate data into meaningful public health impact.\u201d<\/p>\n<p><strong>Collaborating with DataHub<\/strong><\/p>\n<p>DataHub is expanding its efforts across Boston University to help researchers more easily discover, connect, and responsibly use complex data for convergent health research. CHDS is collaborating with developers and stewards of large, high\u2011value datasets to build a shared, well\u2011documented, and interoperable data resource ecosystem that supports cross\u2011disciplinary research. By improving data discoverability and aligning metadata standards, DataHub helps investigators more readily connect data across domains such as health, environment, and social systems. This initiative also supports NSF and NIH data management and sharing requirements by providing guidance on documentation, curation, compliance, and long\u2011term stewardship.<\/p>\n<p>Investigators interested in sharing datasets, exploring collaborations, or learning more about DataHub are encouraged to contact CHDS at <span style=\"text-decoration: underline;\">chdatascience@bu.edu<\/span>. DataHub also offers consultation, training, and collaborative support alongside its technical infrastructure.<\/p>\n<p><strong>About the Center for Health Data Science<\/strong><\/p>\n<p>The <span><a href=\"https:\/\/www.bu.edu\/sph\/\" target=\"_blank\" rel=\"noopener\">Boston University School of Public Health<\/a><\/span> established the <span><a href=\"https:\/\/www.bu.edu\/sph\/research\/centers-and-groups\/health-data-science-center\/\" target=\"_blank\" rel=\"noopener\">Center for Health Data Science (CHDS)<\/a><\/span> in 2024 to advance interdisciplinary health data science research, training, and practice to improve population health. The Center brings together longstanding interdisciplinary expertise in biostatistics, epidemiology, environmental health, data science, and related fields across Boston University. Led by <span><a href=\"https:\/\/www.bu.edu\/sph\/profile\/debbie-cheng\/\" target=\"_blank\" rel=\"noopener\">Debbie Cheng<\/a><\/span>, professor of biostatistics, CHDS supports collaborative research, education, training, and the development of data-driven approaches to address complex public health and biomedical challenges.<\/p>\n<hr \/>\n","protected":false},"excerpt":{"rendered":"<p>The Center for Health Data Science Launches DataHub to Advance AI-Driven and Convergent Research New initiative targets the growing scale and complexity of health data, removing systemic barriers to data discovery and cross-disciplinary collaboration By Maureen Stanton The health sector is generating unprecedented volumes of digital data \u2014 from clinical records to genomics and populationhealth. [&hellip;]<\/p>\n","protected":false},"author":20124,"featured_media":2168,"parent":0,"menu_order":29,"comment_status":"closed","ping_status":"closed","template":"page-templates\/no-sidebars.php","meta":[],"_links":{"self":[{"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/pages\/1192"}],"collection":[{"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/users\/20124"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/comments?post=1192"}],"version-history":[{"count":50,"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/pages\/1192\/revisions"}],"predecessor-version":[{"id":2172,"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/pages\/1192\/revisions\/2172"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/media\/2168"}],"wp:attachment":[{"href":"https:\/\/sites.bu.edu\/healthdatascience\/wp-json\/wp\/v2\/media?parent=1192"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}