BoilerDB

BOILERDB SYSTEMS SEMINAR EXTERNAL-LINK

The seminar is aimed at fostering collaboration and knowledge sharing among graduate students and researchers. The seminar will serve as a platform for students to present their ongoing work, exchange feedback, and engage in interdisciplinary discussions. While the core focus will be on database research, we aim to include students from related fields such as computer architecture and systems to encourage cross-domain insights and potential collaborations.
Also, checkout the parent seminar The Database Systems Seminar organized by Prof. Walid Aref.
Time: Wednesdays
Location: Hybrid
We hold monthly talks on Wednesdays.

Upcoming Talks

Summer 2025

1st Edition

The 1st Edition of the seminar will feature talks by

Shige Liu (Purdue CS)
Libin Zhou (Purdue CS)
Yeasir Rayhan (Purdue CS)
Abdullah Al-Mamun (Purdue CS)
Lu Xing (Purdue CS)
June 4, 2025 11:00 AM - noon (Eastern Time)

TigerVector: Supporting Vector Search in Graph Databases for Advanced RAGs

Shige Liu (Purdue CS) EXTERNAL-LINK

In this paper, we introduce TigerVector, a system that integrates vector search and graph query within TigerGraph, a Massively Parallel Processing (MPP) native graph database. [...]

Shige Liu a third-year Ph.D. student in the Department of Computer Science at Purdue University, advised by Prof. Jianguo Wang. His research focuses on Vector Databases, Graph Databases, and Retrieval-Augmented Generation (RAG). He is passionate about building efficient and practical database systems to power next-generation applications. [...]

TBD

GTX: A Write-Optimized Latch-free Graph Data System with Transactional Support

Libin Zhou (Purdue CS) EXTERNAL-LINK

This paper introduces GTX, a standalone main-memory write-optimized graph data system that specializes in structural and graph property updates while enabling concurrent reads and graph analytics through ACID transactions. [...]

Libin Zhou is a PhD student at Purdue CS [...]

TBD

Exploring Next Token Prediction For Optimizing Databases

Yeasir Rayhan (Purdue CS) EXTERNAL-LINK

The Next Token Prediction paradigm (NTP, for short) lies at the forefront of modern large foundational models that are pre-trained on diverse and large datasets. These models generalize effectively, and have proven to be very successful in Natural Language Processing (NLP). Inspired by the generalization capabilities of Large Language Models (LLMs), we investigate whether the same NTP paradigm can be applied to DBMS design and optimization tasks. [...]

Yeasir Rayhan a 4th year PhD student in Computer Science at Purdue University, advised by Professor Walid G. Aref. [...]

TBD

Learned Indexes From the One-dimensional to the Multi-dimensional Spaces: Challenges, Techniques, and Opportunities

Abdullah Al-Mamun (Purdue CS)EXTERNAL-LINK

[...]

Abdullah Al-Mamun is a PhD candidate at Purdue CS. [...]

TBD

An Adaptive Hotspot-Aware Index for Oscillating Write-Heavy and Read-Heavy Workloads

Lu Xing (Purdue CS)EXTERNAL-LINK

HTAP systems are designed to handle transactional and analytical workloads. Besides a mixed workload at any given time, the workload can also change over time. A popular type of continuously changing workload is one that oscillates between being write-heavy at times and being read-heavy at other times. [...]

Lu Xing is a PhD candidate in Computer Science department at Purdue University. [...]