On the Privacy of Sublinear-Communication Jaccard Index Estimation via Min-hash Sketching

직함: Professor

소속: United States Naval Academy
주최: 박근수 교수
날짜: 2024/6/20 오전 11:00 - 오후 12:00
위치: 302동 209호
요약

The min-hash sketch is a well-known technique for low-communication approximation of the Jaccard index between two input sets.  Moreover, there is a folklore belief that min-hash sketch based protocols protect the privacy of the inputs.  In this paper, we investigate this folklore to quantify the privacy of the min-hash sketch.

We begin our investigation by considering the privacy of min-hash in a centralized setting where the hash functions are chosen by the min-hash functionality and are unknown to the participants.  We show that in this case the min-hash output satisfies the standard definition of differential privacy (DP) without any additional noise.  This immediately yields a privacy-preserving sublinear-communication semi-honest 2-PC protocol based on FHE where the hash function is evaluated homomorphically.

To improve the efficiency of this protocol, we next consider an implementation in the random oracle model. Here, the protocol participants jointly sample public prefixes for domain separation of the random oracle, and locally evaluate the resulting hash functions on their input sets.  Unfortunately, we show that in this public hash function setting, the min-hash output is no longer DP.  We therefore consider the notion of distributional differential privacy (DDP) introduced by Bassily et al. (FOCS 2013). We show that if the honest party's set has sufficiently high min-entropy then the output of the min-hash functionality achieves DDP, again without any added noise. This yields a more efficient semi-honest two-party protocol in the random oracle model, where parties first locally hash their input sets and then perform a 2PC for comparison.    

By proving that our protocols satisfy DP and DDP respectively, our results formally confirm and qualify the folklore belief that min-hash based protocols protect the privacy of their inputs.

연사 소개
  • Professor at United States Naval Academy (2013~)
  • Postdoctoral Researcher at University of Maryland and Columbia University
  • Ph. D from Columbia University
  • M.S and B.S from Seoul National University
  • Research Interest: Development of privacy-enhancing mechanisms, Foundational questions in cryptography such as general feasibility results and relationships between cryptographic primitives