qtbase/serialization at 8764a0c79db30f61cc7f49f80d3bbf7150e1c90f - qtbase - Gitea : Git Mirror

1berry/qtbase

History

Thiago Macieira 8764a0c79d CBOR: fix sorting of UTF16-to-UTF16 strings

This amends commit 394788c68efacdec2676988b4b4ff207b20557f2 (its
ChangeLog applies to this commit too). That fixed sorting of UTF8-to-
UTF16, but when adding more unit tests, I've discovered that some UTF-16
strings also sorted incorrectly. There were two problems:

First, we were assuming that we could rely on the UTF-16 length as a
proxy for the UTF-8 one, but that's not true for some cases:
* both 1-, 2- and 3-codepoint UTF-8 sequences are 1 codepoint
  in UTF-16, so some strings would have identical UTF-16 length
* 4-codepoint UTF-8 sequences shrink to 2-codepoint UTF-16 ones
  (2:1) but 3-codepoint UTF-8 sequences shrink to 1 (3:1), so
  some strings would be longer in UTF-16 but shorter in UTF-8.

Second, QtPrivate::compareStrings performs UTF-16 codepoint comparisons
not Unicode character ones, so surrogate pairs were sorting before
U+E000 to U+FFFF.

To fix all of this, we need to decode the UTF-16 string into UTF-32 and
calculate the length of that in UTF-8 to be sure we have the sorting
order right.

Since this is a slight behavior change with a performance penalty, I am
choosing to backport only to 6.7. The penalty mostly does not apply to
6.8 due to commit 61556627f25e7c7acbfcc5e54127a392b5239977.

Change-Id: If1bf59ecbe014b569ba1fffd17c4c4ddcc874aac
Reviewed-by: Ivan Solovev <ivan.solovev@qt.io>
(cherry picked from commit d4c7da9a07dc1434692fe08a61ba22c794574c4f)
Reviewed-by: Thiago Macieira <thiago.macieira@intel.com>

2024-04-23 19:03:23 -07:00

..

.gitignore

…

make-xml-parser.sh

Correct license for tools files

2024-03-08 18:38:46 +00:00

qcborarray.cpp

Rename Convert Example to Serialization Converter

2023-10-30 18:19:57 +02:00

qcborarray.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcborcommon_p.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcborcommon.cpp

Rename Convert Example to Serialization Converter

2023-10-30 18:19:57 +02:00

qcborcommon.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcbordiagnostic.cpp

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcbormap.cpp

Rename Convert Example to Serialization Converter

2023-10-30 18:19:57 +02:00

qcbormap.h

QCborMap::ConstIterator and Iterator: Add missing destructor

2024-04-11 19:54:14 +00:00

qcborstream.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcborstreamreader.cpp

QCborStreamReader: rename toStringish() -> readAllStringish()

2024-03-12 15:39:25 +00:00

qcborstreamreader.h

QCborStreamReader: rename toStringish() -> readAllStringish()

2024-03-12 15:39:25 +00:00

qcborstreamwriter.cpp

QCborStreamWriter: correct the QCbor{Array,Map} size limitations

2023-11-24 12:42:12 -08:00

qcborstreamwriter.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qcborvalue_p.h

Implement QCborContainerPrivate::compact()

2024-02-05 23:15:07 +00:00

qcborvalue.cpp

CBOR: fix sorting of UTF16-to-UTF16 strings

2024-04-23 19:03:23 -07:00

qcborvalue.h

Q*ValueRef: suppress MSVC warning on deriving from non-exported base

2023-06-18 20:06:57 +00:00

qdatastream_p.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qdatastream.cpp

QDataStream: Turn QDataStreamSizes enum into static contexpr quint32

2024-02-23 15:39:53 +00:00

qdatastream.h

QDataStream: Turn QDataStreamSizes enum into static contexpr quint32

2024-02-23 15:39:53 +00:00

qjson_p.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qjsonarray.cpp

QJsonArray: symmetrize QDataStream op>>/<<

2024-01-31 06:51:41 +00:00

qjsonarray.h

QVariant: make many more QtCore types nothrow-copyable

2022-07-30 07:27:56 -07:00

qjsoncbor.cpp

Long live Q_UNREACHABLE_RETURN()!

2022-10-15 22:11:47 +02:00

qjsondocument.cpp

Rename the JSON Save Game Example to Saving a Game to File

2023-10-20 11:19:48 +02:00

qjsondocument.h

Replace usages of Q_CLANG_QDOC with Q_QDOC

2022-10-21 09:48:36 +02:00

qjsonobject.cpp

QCbor/QJson: s/QPair/std::pair/

2023-12-14 03:01:40 +00:00

qjsonobject.h

QCbor/QJson: s/QPair/std::pair/

2023-12-14 03:01:40 +00:00

qjsonparser_p.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qjsonparser.cpp

corelib: serialization - fix macos unity builds

2024-02-06 07:18:23 +00:00

qjsonvalue.cpp

Rename the JSON Save Game Example to Saving a Game to File

2023-10-20 11:19:48 +02:00

qjsonvalue.h

Q*ValueRef: suppress MSVC warning on deriving from non-exported base

2023-06-18 20:06:57 +00:00

qjsonwriter_p.h

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qjsonwriter.cpp

QJsonWriter: general cleanup

2023-05-01 21:52:22 +02:00

qtextstream_p.h

Port from container::count() and length() to size() - V5

2022-11-03 14:59:24 +01:00

qtextstream.cpp

Doc: Fix typo

2023-04-18 13:39:26 +00:00

qtextstream.h

Remove unneeded include of qfloat16.h

2023-02-28 19:03:53 +01:00

qxmlstream_p.h

QXmlStreamReader: Raise error on unexpected tokens

2023-07-10 22:44:06 +02:00

qxmlstream.cpp

QXmlStreamWriter: fix attempts to write bad QStrings

2024-04-19 19:11:44 +00:00

qxmlstream.g

QXmlStream: fix generating ERROR enum value

2023-07-06 06:28:22 +03:00

qxmlstream.h

Guard xmlstream header in a source-compatible way

2023-07-19 11:56:52 +02:00

qxmlstreamgrammar_p.h

QXmlStream: fix generating ERROR enum value

2023-07-06 06:28:22 +03:00

qxmlstreamgrammar.cpp

Use SPDX license identifiers

2022-05-16 16:37:38 +02:00

qxmlstreamparser_p.h

QXmlStreamReader: make fastScanName() indicate parsing status to callers

2023-06-28 00:11:21 +03:00

qxmlutils_p.h

QDom: Stop treating non-BMP characters as invalid

2022-06-20 21:29:04 +00:00

qxmlutils.cpp

Fix some narrowing conversion warnings

2023-04-08 13:24:04 +02:00