Hi! I am Xiao Luo, a PhD student in Computer Science and Engineering at The Ohio State University. My research focuses on building efficient, accurate, and scalable vector database systems for large-scale similarity search.

More specifically, I am interested in the following research directions:

  • Efficient Similarity Evaluation
    Designing high-performance distance evaluation primitives by leveraging modern CPU vectorization techniques (e.g., AVX2, AVX-512), with a focus on accelerating small-table lookups and quantized distance computations such as Product Quantization (PQ) and Scalar Quantization (SQ).

  • Efficient Approximate Nearest Neighbor Search
    Developing approximate nearest neighbor search systems that tightly integrate graph-based indexing structures with quantization techniques, enabling fast and accurate retrieval.

  • Generalization and Robustness
    Building robust and scalable vector database systems that generalize well across diverse datasets and application scenarios, ensuring stable performance without extensive per-dataset tuning.

Education

OSU logo
PhD of Computer Science and Engineering
The Ohio State University
2024 – Present
GT logo
Master of Electrical and Computer Engineering
Georgia Institute of Technology, Atlanta, USA
2022 - 2024
SCU logo
Bachelor of Engineering, Software Engineering
Sichuan University, Chengdu, China
2018 - 2022

Project


ANN Indices Ensemble Analysis

(Under Revision at VLDB 2026)

  • Studied when and how to train multiple ANN indices and ensemble their candidate results to improve robustness and retrieval accuracy.
  • Analyzed the trade-offs between index size, construction cost, and search accuracy in multi-index ANN systems.
  • Demonstrated that an ensemble of multiple small indices (e.g., two HNSW indices with modest construction and search parameters) can achieve accuracy comparable to a single large index, while requiring only ~30% of the construction time.

Award

Fun Fact

I enjoy solving algorithmic problems in my spare time and write blog posts to document my thoughts and solutions.