SearchTextProcessorInterface::PREG_CLASS_CJK
Same name in other branches
- 9 core/modules/search/src/SearchTextProcessorInterface.php \Drupal\search\SearchTextProcessorInterface::PREG_CLASS_CJK
- 11.x core/modules/search/src/SearchTextProcessorInterface.php \Drupal\search\SearchTextProcessorInterface::PREG_CLASS_CJK
Matches CJK (Chinese, Japanese, Korean) letter-like characters.
This list is derived from the "East Asian Scripts" section of http://www.unicode.org/charts/index.html, as well as a comment on http://unicode.org/reports/tr11/tr11-11.html listing some character ranges that are reserved for additional CJK ideographs.
The character ranges do not include numbers, punctuation, or symbols, since these are handled separately in search. Note that radicals and strokes are considered symbols. (See http://www.unicode.org/Public/UNIDATA/extracted/DerivedGeneralCategory…)
See also
\Drupal\search\SearchTextProcessor::expandCjk()
File
-
core/
modules/ search/ src/ SearchTextProcessorInterface.php, line 59
Class
- SearchTextProcessorInterface
- Processes search text for indexing.
Namespace
Drupal\searchCode
const PREG_CLASS_CJK = '\\x{1100}-\\x{11FF}\\x{3040}-\\x{309F}\\x{30A1}-\\x{318E}' . '\\x{31A0}-\\x{31B7}\\x{31F0}-\\x{31FF}\\x{3400}-\\x{4DBF}\\x{4E00}-\\x{9FCF}' . '\\x{A000}-\\x{A48F}\\x{A4D0}-\\x{A4FD}\\x{A960}-\\x{A97F}\\x{AC00}-\\x{D7FF}' . '\\x{F900}-\\x{FAFF}\\x{FF21}-\\x{FF3A}\\x{FF41}-\\x{FF5A}\\x{FF66}-\\x{FFDC}' . '\\x{20000}-\\x{2FFFD}\\x{30000}-\\x{3FFFD}';
Buggy or inaccurate documentation? Please file an issue. Need support? Need help programming? Connect with the Drupal community.