Featured · First-author research
InsightSort
The system behind “Sort it Like You Mean It” (PVLDB Vol. 18, No. 12, 2025). A GPT-4o-powered semantic analytical engine that maps 5–7 context-aware ranking dimensions onto target datasets and enriches tabular records with schemas missing from the host table. A dual 384-dim HNSW vector-index layer over data lakes replaces O(N) column scans with sub-linear top-5 ANN lookups, isolating valid join paths in milliseconds and emitting optimized DuckDB SQL patches.
- Python
- GPT-4o
- HNSW
- Embeddings
- DuckDB
- React.js
- Flask