Welcome to mirror list, hosted at ThFree Co, Russian Federation.

cygwin.com/git/newlib-cygwin.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorCorinna Vinschen <corinna@vinschen.de>2009-04-01 13:49:25 +0400
committerCorinna Vinschen <corinna@vinschen.de>2009-04-01 13:49:25 +0400
commitb4bf8c7064f0934180e6723c7b2ae91cf2c5909e (patch)
tree25f308a6c9833b448d2079b83d107ffbd40d0c4d /winsup/cygwin/strfuncs.cc
parentd9749de7664550775f1cf9b67f4a708a5d56f685 (diff)
* strfuncs.cc (sys_cp_mbstowcs): Check if ASCII SO is followed by at
least two chars and the next byte is a valid UTF-8 start byte before trying to convert the followup bytes as UTF-8 sequence.
Diffstat (limited to 'winsup/cygwin/strfuncs.cc')
-rw-r--r--winsup/cygwin/strfuncs.cc8
1 files changed, 6 insertions, 2 deletions
diff --git a/winsup/cygwin/strfuncs.cc b/winsup/cygwin/strfuncs.cc
index bb3682225..2af6bf3ca 100644
--- a/winsup/cygwin/strfuncs.cc
+++ b/winsup/cygwin/strfuncs.cc
@@ -425,8 +425,12 @@ sys_cp_mbstowcs (UINT cp, PWCHAR dst, size_t dlen, const char *src, size_t nms)
len = (size_t)-1;
while (len > 0 && nms > 0)
{
- /* ASCII SO. Convert following UTF-8 sequence (if not UTF-8 anyway). */
- if (*pmbs == 0x0e && *charset != 'U'/*TF-8*/)
+ /* ASCII SO. Sanity check: If this is a lead SO byte for a following
+ UTF-8 sequence, there must be at least two more bytes left, and the
+ next byte must be a valid UTF-8 start byte. If the charset isn't
+ UTF-8 anyway, try to convert the following bytes as UTF-8 sequence. */
+ if (*pmbs == 0x0e && nms > 2 && *(unsigned char *) (pmbs + 1) >= 0xc2
+ && *(unsigned char *) (pmbs + 1) <= 0xf4 && *charset != 'U'/*TF-8*/)
{
pmbs++;
--nms;