汉明距离
汉明空间
计算机科学
最近邻搜索
相似性(几何)
汉明码
背景(考古学)
汉明界
汉明重量
算法
汉明图
理论计算机科学
模式识别(心理学)
人工智能
图像(数学)
古生物学
解码方法
生物
区块代码
作者
Vikram Singh,Chandradeep Kumar
标识
DOI:10.1504/ijiids.2023.128275
摘要
In the modern context, similarity is driven by the quality-features of the data objects and steered by content preserving stimuli, as retrieval of relevant 'nearest neighbourhood' objects and the way similar objects are pursued. Current similarity searches in Hamming-space-based strategies finds all the data objects within a threshold Hamming-distance for a user query. Though, the numbers of computations for Hamming-distance and candidate generation are the key concerns from the several years. The Hamming-space paradigm extends the range of alternatives for an optimised search experience. A novel 'counting-based' similarity search strategy is proposed, with an a priori and improved Hamming-space estimation, e.g., optimised candidate generation and verification functions. The strategy adapts towards the lesser set of user query dimensions and subsequently constraints the Hamming-space computations with each data objects, driven by generated statistics. The extensive evaluation asserts that the proposed counting-based approach can be combined with any pigeonhole principle-based similarity search to further improve its performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI