IT技术 - JavaScript 中如何检查字符串是否包含子字符串？ - 吾爱随笔录

JavaScript 中如何检查字符串是否包含子字符串？

IT技术 javascript string substring string-matching

2021-01-05 01:21:37

通常我会期待一种String.contains()方法，但似乎没有。

检查这个的合理方法是什么？

3个回答

ECMAScript 6 引入String.prototype.includes：

const string = "foo";
const substring = "oo";

console.log(string.includes(substring)); // true

includes 但是没有 Internet Explorer 支持。在 ECMAScript 5 或更旧的环境中，使用String.prototype.indexOf，当找不到子字符串时返回 -1：

var string = "foo";
var substring = "oo";

console.log(string.indexOf(substring) !== -1); // true

虽然这是一个很好的答案，并且 OP 从未要求进行“区分大小写”的搜索，但应注意includes执行区分大小写的搜索。

2021-02-18 01:21:37

包括为空 substring 返回 true 。

2021-02-20 01:21:37

@Aashiq：是的，空字符串是每个字符串的子字符串。

2021-03-08 01:21:37

String.prototype.includesES6中有一个：

"potato".includes("to");
> true

请注意，这在 Internet Explorer 或其他一些没有或不支持 ES6 的旧浏览器中不起作用。为了让它在旧浏览器中工作，你可能希望使用像Babel这样的转译器，像es6-shim这样的 shim 库，或者来自 MDN 的这个polyfill：

if (!String.prototype.includes) {
  String.prototype.includes = function(search, start) {
    'use strict';
    if (typeof start !== 'number') {
      start = 0;
    }

    if (start + search.length > this.length) {
      return false;
    } else {
      return this.indexOf(search, start) !== -1;
    }
  };
}

此外，检查number无法像includes. 示例：es6 包含"abc".includes("ab", "1")此polyfill 的返回 false将返回 true

2021-02-28 01:21:37

只是好奇，为什么你需要检查长度？在那种情况下 IE 是否会失败？

2021-03-08 01:21:37

另一种选择是KMP（Knuth-Morris-Pratt）。

KMP 算法在最坏情况下的 O( n + m ) 时间内搜索长度为n 的字符串中的长度为m 的子串，相比于简单算法的 O( n ⋅ m )的最坏情况，因此使用 KMP 可能如果您关心最坏情况的时间复杂度，那是合理的。

这是 Nayuki 项目的 JavaScript 实现，摘自https://www.nayuki.io/res/knuth-morris-pratt-string-matching/kmp-string-matcher.js：

// Searches for the given pattern string in the given text string using the Knuth-Morris-Pratt string matching algorithm.
// If the pattern is found, this returns the index of the start of the earliest match in 'text'. Otherwise -1 is returned.

function kmpSearch(pattern, text) {
  if (pattern.length == 0)
    return 0; // Immediate match

  // Compute longest suffix-prefix table
  var lsp = [0]; // Base case
  for (var i = 1; i < pattern.length; i++) {
    var j = lsp[i - 1]; // Start by assuming we're extending the previous LSP
    while (j > 0 && pattern.charAt(i) != pattern.charAt(j))
      j = lsp[j - 1];
    if (pattern.charAt(i) == pattern.charAt(j))
      j++;
    lsp.push(j);
  }

  // Walk through text string
  var j = 0; // Number of chars matched in pattern
  for (var i = 0; i < text.length; i++) {
    while (j > 0 && text.charAt(i) != pattern.charAt(j))
      j = lsp[j - 1]; // Fall back in the pattern
    if (text.charAt(i) == pattern.charAt(j)) {
      j++; // Next char matched, increment position
      if (j == pattern.length)
        return i - (j - 1);
    }
  }
  return -1; // Not found
}

console.log(kmpSearch('ays', 'haystack') != -1) // true
console.log(kmpSearch('asdf', 'haystack') != -1) // false

不对这种方法提出任何质疑……但是为什么要在有includes或的地方实施 KMP indexOf。（虽然那些可能使用 KMP 的底层实现......不确定）

2021-02-15 01:21:37

@wz366 KMP 提供 O(n)，剩下的呢？任何的想法？

2021-02-25 01:21:37

KMP 在这里提供线性 O(n) 性能。

2021-02-26 01:21:37

如果这是为了提高速度，如果您将其替换.charAt(i)为[i]以避免额外的函数调用，它可能会运行得更快。

2021-03-01 01:21:37

其它你可能感兴趣的问题

上一篇如何将 JavaScript 变量传递给 PHP？下一篇JavaScript .prototype 如何工作？