News

Undergraduate Student Disproves 40-Year-Old Data Structure Conjecture

Published

5 months ago

March 16, 2025

Undergraduate Student Computer Science Breakthrough

CAMBRIDGE, England — In a remarkable turn of events in the field of computer science, Andrew Krapivin, a graduate student at the University of Cambridge, has made a groundbreaking discovery that disrupts a 40-year-old conjecture related to hash tables.

Krapivin’s journey began as an undergraduate at Rutgers University in fall 2021 when he came across a paper discussing ‘tiny pointers,’ which are entities that can direct users to specific information or elements within a computer’s memory. After setting aside time to revisit the paper two years later, Krapivin explored ways to minimize the memory usage of these pointers. He eventually stumbled upon a novel approach to organizing data that could not only create more efficient pointers but also led him to invent a still faster type of hash table.

Initially skeptical of Krapivin’s claims, his former professor, Michael Farach-Colton of New York University, enlisted the help of a colleague, Scott Kuszmaul of Carnegie Mellon University, to evaluate Krapivin’s work. Contrary to Farach-Colton’s apprehensions, Kuszmaul found something extraordinary: “You’ve actually completely wiped out a 40-year-old conjecture!”

In their comprehensive research, Krapivin, Farach-Colton, and Kuszmaul proved that this new hash table effectively allows searches and insertions to be completed faster than what was previously deemed possible, contradicting a long-held belief in computer science. The conjecture in question, posed by computer scientist Andrew Yao in 1985, stated that the worst-case time for queries and insertions in hash tables could never be better than a proportionate value known as ‘x.’

“I did this without knowing about Yao’s conjecture,” Krapivin remarked, emphasizing that his lack of prior knowledge of the conjecture uniquely positioned him to innovate without bias. The team demonstrated that their new hash table operates in a worst-case time complexity of (log x)², which is significantly faster than the previously accepted bound of x.

This groundbreaking achievement is underscored by its implications for how hash tables function and their various applications in data storage. “This result is beautiful in that it addresses and solves such a classic problem,” commented A. L. H. K. of Carnegie Mellon University. “It’s not just that they disproved [Yao’s conjecture], they also found the best possible answer to his question,” added Andrew Wright from the University of Waterloo.

The research also hints at a new horizon. Krapivin and his colleagues explored the average time taken for queries across all possible scenarios, disproving another earlier limitation by presenting a non-greedy hash table that demonstrated a constant average query time. “You get a number that is just a constant and doesn’t depend on how full the hash table is,” Farach-Colton elaborated.

While the immediate applications of this research remain speculative, experts agree on its importance. “It’s significant to understand these kinds of data structures better. You don’t know when a result like this will unlock something that lets you do better in practice,” said historian and mathematical expert David Conway.

As computer scientists continue to unpack the ramifications of Krapivin’s findings, this research stands as a testament to the power of curiosity and innovation in the academic world, illustrating how even a fresh perspective can lead to monumental advances in understanding fundamental computing principles.