Innovative Data Technologies (IDT) Laboratory


Making scientific data management efficient and easy.

Research Projects

Multiple funded positions opened!


If interested in performing cutting-edge research and development in these areas, please send your CV and transcript to Prof. Byna by email.

Research


In the Innovative Data Technologies (IDT) lab, we conduct research in all aspects of data management for science, including storage and I/O, file systems, metadata management, data quality assessment and improvement, performance analysis, performance tuning, data security, and energy-efficiency. Our emphasis is on developing systems and tools that make managing scientific data efficient and easy for scientists using high-performance computing (HPC), cloud, and edge computing systems.

Novel Data Systems

Brief explanation regarding the research area

PDC

Data Readiness and Quality

Brief explanation regarding the research area

AIDRIN APPFL

I/O Characterization Tuning

Brief explanation regarding the research area

DRISHTI

Data Security

Securing data, formats, and libraries

S2-D2

Declarative Analytics at Scale

Optimizing data movement in declarative analytics

Datalog

Data Querying

Querying for science data

AI Query

Publications


2024

  • Runzhou Han, Mai Zheng, Suren Byna, Houjun Tang, Bin Dong, Dong Dai, Yong Chen, Dongkyun Kim, Joseph Hassoun, David Thorsley, and Matthew Wolf, "PROV-IO+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems", IEEE Transactions on Parallel and Distributed Systems (TPDS), 2024.
  • Jean Luca Bez, Houjun Tang, Scot Breitenfeld, Huihuo Zheng, Wei-keng Liao, Kaiyuan Hou, Zanhua Huang, and Suren Byna, "h5bench: Exploring HDF5 Access Patterns Performance in Pre-Exascale Platforms", Concurrency and Computation: Practice and Experience (CCPE), 2024.
  • Jean Luca Bez, Hammad Ather, Yankun Xia, and Suren Byna "Drilling Down I/O Bottlenecks with Cross-layer I/O Profile Exploration", IPDPS 2024.
  • Neeraj Rajesh, Keith Bateman, Suren Byna, Jean Luca Bez, Anthony Kougkas, and Xian-He Sun, "TunIO: An AI-powered Framework for Optimizing HPC I/O", IPDPS 2024.
  • Dong Kyu Sung, Yongseok Son, Alex Sim, John Wu, Suren Byna, Houjun Tang, Hyeonsang Eom, and Sunggon Kim, "A2FL: Autonomous and Adaptive File Layout in HPC through Real-time Access Pattern", IPDPS 2024.
  • Wei Zhang, Houjun Tang, and Suren Byna, "IDIOMS: Index-powered Distributed Object-centric Metadata Search for Scientific Data Management", CCGrid 2024.
  • Bin Dong, John Wu, and Suren Byna, "The Art of Sparsity: Mastering High-Dimensional Tensor Storage", ESSA 2024 in conjunction with IPDPS 2024.
Learn more

People


Faculty

Suren Byna

Suren Byna

Professor

Students

Kaveen Hiniduma

Kaveen Hiniduma

Ph.D Student

Git
Hyunju Oh

Hyunju Oh

Ph.D Student

Git
Suben Kumar Saha

Suben Kumar Saha

Ph.D Student

Git
Arta Salimiparsa

Arta Salimiparsa

Ph.D Student

Aparajit Talukdar

Aparajit Talukdar

Ph.D Student

Git

Collaborators

Jean Luca Bez

Jean Luca Bez

Research Scientist, LBNL

Houjun Tang

Houjun Tang

Research Scientist, LBNL

Wei Zhang

Wei Zhang

Researcher, LBNL

IDT Lab Information


Address

Dreese Laboratory, 2015 Neil Ave, Columbus, OH 43210

Contact Information

byna.1@osu.edu