| Design and Development of Medical Databases |
Tae Hoon Kong1,2,3,4
, Young Joon Seo1,2,3,4
|
1Department of Otorhinolaryngology-Head and Neck Surgery, Yonsei University Wonju College of Medicine, Wonju, Korea 2Department of Medical Informatics and Biostatistics, Yonsei University Wonju College of Medicine, Wonju, Korea 3Department of Bio-AI Convergence, Yonsei University Wonju College of Medicine, Wonju, Korea 4Research Institute of Hearing Enhancement, Yonsei University Wonju College of Medicine, Wonju, Korea |
| 의학 데이터베이스의 설계와 구축 |
공태훈1,2,3,4
, 서영준1,2,3,4
|
1연세대학교 원주의과대학 이비인후과학교실 2연세대학교 원주의과대학 의료정보통계학과 3연세대학교 원주의과대학 바이오-AI 융합학과 4연세대학교 원주의과대학 청각재활연구소 |
|
|
|
Received: 26 February 2025; Revised: December 10, 2025 Accepted: 15 December 2025. Published online: 13 April 2026. |
|
|
|
| ABSTRACT |
|
Since the Fourth Industrial Revolution was declared at the World Economic Forum in 2016, we have witnessed significant technological innovations across various fields. Artificial intelligence (AI) technology has developed alongside computing technology, algorithms, and platforms, but its most crucial foundation lies in data collection and management capabilities. This paper aims to share expertise gained through our experience in constructing medical databases, including the Hearing Big Data Center, Common Data Model implementation, and balance function test databases. We first explain the fundamental concepts of databases, distinguishing between structured and unstructured data, and introduce relational database models and data schemas particularly relevant to otolaryngology. We then describe practical strategies for designing and operating medical databases that can flexibly handle both structured and unstructured data, using hearing and vestibular function test data as representative examples. Additionally, we address important considerations for healthcare database construction, including privacy regulations, de-identification processes, and data combination techniques according to South Korean guidelines. Finally, we discuss how well-designed medical databases can be linked with AI and machine learning models, enabling multimodal analysis that integrates clinical data, imaging, and physiological signals. We argue that by understanding database design principles and proper data management approaches, medical researchers can more efficiently utilize vast medical data resources while maintaining patient privacy, ultimately advancing medical research and clinical practice in the era of AI. |
| Keywords:
Artificial intelligenceㆍ Databasesㆍ Data collectionㆍ De-identificationㆍ Otolaryngology |
|