-
utf8.php
-
-
UTF8
-
Put the current directory in this constant
-
utf8_accents_to_ascii
-
Replace accented UTF-8 characters by unaccented ASCII-7 "equivalents".
-
utf8_is_ascii
-
Tests whether a string contains only 7bit ASCII bytes.
-
utf8_is_ascii_ctrl
-
Tests whether a string contains only 7bit ASCII bytes with device control codes omitted. The device control codes can be found on the second table here: http://www.w3schools.com/tags/ref_ascii.asp
-
utf8_strip_ascii_ctrl
-
Strip out device control codes in the ASCII range
-
utf8_strip_non_ascii
-
Strip out all non-7bit ASCII bytes If you need to transmit a string to system which you know can only support 7bit ASCII, you could use this function.
-
utf8_strip_non_ascii_ctrl
-
Strip out all non 7bit ASCII bytes and ASCII device control codes.
-
UTF8_BAD_5OCTET
-
Return code from utf8_bad_identify() when a five octet sequence is detected.
-
UTF8_BAD_6OCTET
-
Return code from utf8_bad_identify() when a six octet sequence is detected.
-
utf8_bad_explain
-
Takes a return code from utf8_bad_identify() are returns a message (in English) explaining what the problem is.
-
utf8_bad_find
-
Locates the first bad byte in a UTF-8 string returning it's
-
utf8_bad_findall
-
Locates all bad bytes in a UTF-8 string and returns a list of their
-
utf8_bad_identify
-
Reports on the type of bad byte found in a UTF-8 string. Returns a
-
UTF8_BAD_NONSHORT
-
Return code from utf8_bad_identify().
-
utf8_bad_replace
-
Replace bad bytes with an alternative character - ASCII character
-
UTF8_BAD_SEQID
-
Return code from utf8_bad_identify().
-
UTF8_BAD_SEQINCOMPLETE
-
Return code from utf8_bad_identify().
-
utf8_bad_strip
-
Strips out any bad bytes from a UTF-8 string and returns the rest
-
UTF8_BAD_SURROGATE
-
Return code from utf8_bad_identify().
-
UTF8_BAD_UNIOUTRANGE
-
Return code from utf8_bad_identify().
-
utf8_byte_position
-
Given a string and a character index in the string, in
-
utf8_locate_current_chr
-
Given a string and any byte index, returns the byte index
-
utf8_locate_next_chr
-
Given a string and any byte index, returns the byte index of the start of the next UTF-8 character, relative to supplied position. If the next character begins at the same place as the supplied byte index, that byte index will be returned.
-
ucfirst.php
-
-
ucwords.php
-
-
UTF8_CORE
-
Define UTF8_CORE as required
-
utf8_ireplace
-
UTF-8 aware alternative to str_ireplace
-
utf8_ltrim
-
UTF-8 aware replacement for ltrim()
-
utf8_ord
-
UTF-8 aware alternative to ord
-
utf8_rtrim
-
UTF-8 aware replacement for rtrim()
-
utf8_strcasecmp
-
UTF-8 aware alternative to strcasecmp
-
utf8_strcspn
-
UTF-8 aware alternative to strcspn
-
utf8_stristr
-
UTF-8 aware alternative to stristr
-
utf8_strlen
-
Wrapper round mb_strlen
-
utf8_strpos
-
Assumes mbstring internal encoding is set to UTF-8
-
utf8_strrev
-
UTF-8 aware alternative to strrev
-
utf8_strrpos
-
Assumes mbstring internal encoding is set to UTF-8
-
utf8_strspn
-
UTF-8 aware alternative to strspn
-
utf8_strtolower
-
Assumes mbstring internal encoding is set to UTF-8
-
utf8_strtoupper
-
Assumes mbstring internal encoding is set to UTF-8
-
utf8_str_pad
-
Replacement for str_pad. $padStr may contain multi-byte characters.
-
utf8_str_split
-
UTF-8 aware alternative to str_split
-
utf8_substr
-
Assumes mbstring internal encoding is set to UTF-8
-
utf8_substr_replace
-
UTF-8 aware substr_replace.
-
utf8_trim
-
UTF-8 aware replacement for trim()
-
utf8_ucfirst
-
UTF-8 aware alternative to ucfirst
-
utf8_ucwords
-
UTF-8 aware alternative to ucwords
-
utf8_ucwords_callback
-
Callback function for preg_replace_callback call in utf8_ucwords
-
unicode.php
-
-
utf8_from_unicode
-
Takes an array of ints representing the Unicode characters and returns a UTF-8 string. Astral planes are supported ie. the ints in the input can be > 0xFFFF. Occurrances of the BOM are ignored. Surrogates are not allowed.
-
utf8_to_unicode
-
Takes an UTF-8 string and returns an array of ints representing the Unicode characters. Astral planes are supported ie. the ints in the output can be > 0xFFFF. Occurrances of the BOM are ignored. Surrogates are not allowed.
-
utf8_is_word_chars
-
Checks a string for whether it contains only word characters. This
-
utf8_specials_pattern
-
Used internally. Builds a PCRE pattern from the $UTF8_SPECIAL_CHARS
-
utf8_strip_specials
-
Removes special characters (nonalphanumeric) from a UTF-8 string
-
utf8_compliant
-
Tests whether a string complies as UTF-8. This will be much
-
utf8_is_valid
-
Tests a string as to whether it's valid UTF-8 and supported by the