模糊搜索的方法

Ynov

于 2025-06-16 08:23:19 发布

阅读量1k

点赞数 25

CC 4.0 BY-SA版权

本文链接：https://blog.csdn.net/2503_91259466/article/details/148681317

模糊搜索的实现方法

模糊搜索(Fuzzy Search)是一种近似匹配的搜索技术，能够在用户输入不完整或不准确时仍能找到相关结果。以下是几种常见的模糊搜索实现方法：

一、基础模糊搜索方法

1. 通配符搜索

使用 * 或 % 作为通配符
示例："app*" 匹配 "apple", "application" 等

2. 包含搜索

检查目标字符串是否包含搜索词
示例："ppl" 匹配 "apple"

javascript

复制

下载

function containsSearch(text, query) {
  return text.toLowerCase().includes(query.toLowerCase());
}

二、高级模糊搜索算法

1. Levenshtein距离（编辑距离）

计算将一个字符串转换成另一个字符串所需的最少编辑操作次数
操作包括：插入、删除、替换字符

function levenshteinDistance(a, b) {
  const matrix = [];
  for (let i = 0; i <= b.length; i++) matrix[i] = [i];
  for (let j = 0; j <= a.length; j++) matrix[0][j] = j;
  
  for (let i = 1; i <= b.length; i++) {
    for (let j = 1; j <= a.length; j++) {
      const cost = a[j-1] === b[i-1] ? 0 : 1;
      matrix[i][j] = Math.min(
        matrix[i-1][j] + 1,    // 删除
        matrix[i][j-1] + 1,    // 插入
        matrix[i-1][j-1] + cost // 替换
      );
    }
  }
  return matrix[b.length][a.length];
}

2. N-gram相似度

将字符串分割为长度为N的连续字符序列
比较两个字符串的N-gram集合的相似度

function getNGrams(str, n) {
  const grams = [];
  for (let i = 0; i <= str.length - n; i++) {
    grams.push(str.substr(i, n));
  }
  return grams;
}

function nGramSimilarity(a, b, n = 2) {
  const aGrams = getNGrams(a, n);
  const bGrams = getNGrams(b, n);
  const common = aGrams.filter(g => bGrams.includes(g));
  return common.length / Math.max(aGrams.length, bGrams.length);
}

3. 模糊匹配评分算法

基于字符出现顺序、连续匹配等给出评分
如Fuse.js使用的算法

三、实用实现方案

1. 使用现有库

Fuse.js - 轻量级模糊搜索库

const fuse = new Fuse(list, { keys: ['name'] });
const result = fuse.search('appel');

Lunr.js - 类似Solr的客户端搜索库

2. 数据库模糊搜索

SQL:

SELECT * FROM products WHERE name LIKE '%app%';

MongoDB:

db.products.find({ name: { $regex: /app/i } });

3. 前端实时模糊搜索实现

function fuzzySearch(items, query) {
  const queryLower = query.toLowerCase();
  return items.filter(item => {
    const itemLower = item.toLowerCase();
    let queryIndex = 0;
    let itemIndex = 0;
    
    while (itemIndex < itemLower.length && queryIndex < queryLower.length) {
      if (itemLower[itemIndex] === queryLower[queryIndex]) {
        queryIndex++;
      }
      itemIndex++;
    }
    
    return queryIndex === queryLower.length;
  });
}