google.appengine.api.search.unicode_util module

Summary

Utility methods related to Unicode.

Contents

google.appengine.api.search.unicode_util.LimitUnicode(s)source

Replaces 32-bit Unicode characters with surrogate pairs.

Returns a version of the string argument with all Unicode characters above 0xFFFF (those from the Supplementary Plane) replaced with the appropriate surrogate pairs. If there are no such characters, returns the same string instance.

See https://wikipedia.org/wiki/UTF-16#U.2B10000_to_U.2B10FFFF

Parameters
  • s – a string, possibly of unicode type, to be converted

  • necessary. (if) –

Returns

Unicode string with surrogate pairs, or the argument unmodified.