From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 38483 invoked by alias); 21 May 2018 18:03:45 -0000 Mailing-List: contact gdb-patches-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: gdb-patches-owner@sourceware.org Received: (qmail 38446 invoked by uid 89); 21 May 2018 18:03:44 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.6 required=5.0 tests=AWL,BAYES_00,MIME_BASE64_BLANKS,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 spammy=arrive, latin, Hx-languages-length:2734, letter X-HELO: esa2.dell-outbound.iphmx.com Received: from esa2.dell-outbound.iphmx.com (HELO esa2.dell-outbound.iphmx.com) (68.232.149.220) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Mon, 21 May 2018 18:03:42 +0000 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: =?us-ascii?q?A2EfAQC6CQNbmGKa6ERcGgEBAQEBAgEBA?= =?us-ascii?q?QEIAQEBAYQWgTMKg2uUd4F5gQ+TNoF4C4RsAhqCACE2FgECAQEBAQEBAgEBAhA?= =?us-ascii?q?BAQEBAQgLCwYoL4I1IoJTAQEBAwEjEUUFCwIBCBQBAwICJgICAkUQAgQOBRuDB?= =?us-ascii?q?4F5CKgNghyIQYIPCQF/hyyCE4EygjQ1hHMXIxKCNDCCJAKMIIwsBwKOV4x9kHe?= =?us-ascii?q?BJSMHgX1wegGCGIIujiBvjgsrgQGBGAEB?= X-IPAS-Result: =?us-ascii?q?A2EfAQC6CQNbmGKa6ERcGgEBAQEBAgEBAQEIAQEBAYQWgTM?= =?us-ascii?q?Kg2uUd4F5gQ+TNoF4C4RsAhqCACE2FgECAQEBAQEBAgEBAhABAQEBAQgLCwYoL?= =?us-ascii?q?4I1IoJTAQEBAwEjEUUFCwIBCBQBAwICJgICAkUQAgQOBRuDB4F5CKgNghyIQYI?= =?us-ascii?q?PCQF/hyyCE4EygjQ1hHMXIxKCNDCCJAKMIIwsBwKOV4x9kHeBJSMHgX1wegGCG?= =?us-ascii?q?IIujiBvjgsrgQGBGAEB?= Received: from esa4.dell-outbound2.iphmx.com ([68.232.154.98]) by esa2.dell-outbound.iphmx.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 21 May 2018 13:03:19 -0500 From: Received: from ausc60pc101.us.dell.com ([143.166.85.206]) by esa4.dell-outbound2.iphmx.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 22 May 2018 00:03:19 +0600 X-LoopCount0: from 10.166.136.215 X-DLP: DLP_GlobalPCIDSS To: CC: , , Subject: Re: support C/C++ identifiers named with non-ASCII characters Date: Mon, 21 May 2018 18:34:00 -0000 Message-ID: References: <9418d4f0-f22a-c587-cc34-2fa67afbd028@zjz.name> <8c8af079-dbb8-207b-5edf-86b99e9f5db8@simark.ca> <834lj1f0ne.fsf@gnu.org> In-Reply-To: <834lj1f0ne.fsf@gnu.org> Content-Type: text/plain; charset="utf-8" Content-ID: <7F9AB9E434CC124E955699A93F98ED0C@dell.com> Content-Transfer-Encoding: base64 MIME-Version: 1.0 X-SW-Source: 2018-05/txt/msg00500.txt.bz2 DQoNCj4gT24gTWF5IDIxLCAyMDE4LCBhdCAxMjoxMiBQTSwgRWxpIFphcmV0 c2tpaSA8ZWxpekBnbnUub3JnPiB3cm90ZToNCj4gDQo+PiBGcm9tOiA8UGF1 bC5Lb25pbmdAZGVsbC5jb20+DQo+PiBDQzogPHpqekB6anoubmFtZT4sIDxn ZGItcGF0Y2hlc0Bzb3VyY2V3YXJlLm9yZz4NCj4+IERhdGU6IE1vbiwgMjEg TWF5IDIwMTggMTQ6MTI6MTIgKzAwMDANCj4+IA0KPj4+IEdpdmVuIHVubGlt aXRlZCB0aW1lLCB3b3VsZCB0aGUgcmlnaHQgc29sdXRpb24gYmUgdG8gdXNl IGEgbGliIHRvIHBhcnNlIHRoZQ0KPj4+IHN0cmluZyBhcyB1dGYtOCwgYW5k IHJlamVjdCBzdHJpbmdzIHRoYXQgYXJlIG5vdCB2YWxpZCB1dGYtOD8NCj4+ IA0KPj4gVGhpcyBzb3VuZHMgbGlrZSBhIHNjZW5hcmlvIHdoZXJlICJzdHJp bmdwcmVwIiBpcyBoZWxwZnVsIChvciBuZWNlc3NhcnkpLiAgSXQgdmFsaWRh dGVzIHN0cmluZ3MgdG8gYmUgdmFsaWQgdXRmLTgsIGNhbiBjaGVjayB0aGF0 IHRoZXkgb2JleSBjZXJ0YWluIHJ1bGVzIChzdWNoIGFzICJ3b3JkIGVsZW1l bnRzIG9ubHkiIHdoaWNoIHJlamVjdHMgcHVuY3R1YXRpb24gYW5kIHRoZSBs aWtlKSwgYW5kIGNhbiBjb252ZXJ0IHRoZW0gdG8gYSBjYW5vbmljYWwgZm9y bSBzbyBlcXVhbCBzdHJpbmdzIG1hdGNoIHdoZXRoZXIgdGhleSBhcmUgZW5j b2RlZCB0aGUgc2FtZSBvciBub3QuDQo+IA0KPiBJcyBpdCBhIGZhY3QgdGhh dCBub24tQVNDSUkgaWRlbnRpZmllcnMgbXVzdCBiZSBlbmNvZGVkIGluIFVU Ri04LCBhbmQNCj4gY2FuIG5vdCBpbmNsdWRlIGludmFsaWQgVVRGLTggc2Vx dWVuY2VzPw0KDQpFbmNvZGluZyBpcyBhIEkvTyBxdWVzdGlvbi4gICJVVEYt OCIgYW5kICJVbmljb2RlIiBhcmUgb2Z0ZW4gbWl4ZWQgdXAsIGJ1dCB0aGV5 IGFyZSBkaXN0aW5jdC4gIFVuaWNvZGUgaXMgYSBjaGFyYWN0ZXIgc2V0LCBp biB3aGljaCBlYWNoIGNoYXJhY3RlciBoYXMgYSBudW1lcmljIGlkZW50aWZp Y2F0aW9uLiAgRm9yIGV4YW1wbGUsIOW8tSBpcyBVbmljb2RlIGNoYXJhY3Rl ciBudW1iZXIgMjQzNzMgKDB4NWYzNSkuDQoNClVURi04IGlzIG9uZSBvZiBz ZXZlcmFsIHdheXMgdG8gZW5jb2RlIFVuaWNvZGUgY2hhcmFjdGVycyBhcyBh IGJ5dGUgc3RyZWFtLiAgVGhlIFVURi04IGVuY29kaW5nIG9mIOW8tSBpcyBl NSBiYyBiNS4NCg0KSSBkb24ndCBrbm93IHdoYXQgdGhlIEMvQysrIHN0YW5k YXJkcyBzYXkgYWJvdXQgbm9uLUFTQ0lJIGlkZW50aWZpZXJzLiAgSSBhc3N1 bWUgdGhleSBhcmUgc3RhdGVkIHRvIGJlIFVuaWNvZGUsIGFuZCBwcmVzdW1h Ymx5IHNwZWNpZmljIFVuaWNvZGUgY2hhcmFjdGVyIGNsYXNzZXMuICBTbyB0 aGVyZSBhcmUgc29tZSBzZXF1ZW5jZXMgb2YgVW5pY29kZSBjaGFyYWN0ZXJz IHRoYXQgYXJlIHZhbGlkIGlkZW50aWZpZXJzLCB3aGlsZSBvdGhlcnMgYXJl IG5vdCAtLSBleGFjdGx5IGFzICJhYmMiIGlzIGEgdmFsaWQgQVNDSUkgaWRl bnRpZmllciB3aGlsZSAiYUBiYyIgaXMgbm90Lg0KDQpBIHNlcGFyYXRlIHF1 ZXN0aW9uIGlzIHRoZSBlbmNvZGluZyBvZiBmaWxlcy4gIFRoZSBlbmNvZGlu ZyBydWxlIGNvdWxkIGJlIHRoYXQgVVRGLTggaXMgcmVxdWlyZWQgLS0gb3Ig dGhhdCB0aGUgZW5jb2RpbmcgaXMgc2VsZWN0YWJsZS4gIFRoZXJlIGFsc28g aGFzIHRvIGJlIGFuIGVuY29kaW5nIGluIG91dHB1dCBmaWxlcyAoZGVidWcg ZGF0YSBmb3IgZXhhbXBsZSkuICBBbmQgd2hlbiBzdHJpbmdzIGFyZSBlbnRl cmVkIGF0IHRoZSBHREIgdXNlciBpbnRlcmZhY2UsIHRoZXkgYXJyaXZlIGlu IHNvbWUgZW5jb2RpbmcuICBGb3IgYWxsIHRoZXNlLCBVVEYtOCBpcyBhIGxv Z2ljYWwgYW5zd2VyLg0KDQpOb3QgYWxsIGJ5dGUgc3RyaW5ncyBhcmUgdmFs aWQgVVRGLTggc3RyaW5ncy4gIFdoZW4gYSBieXRlIHN0cmluZyBpcyBkZWxp dmVyZWQgZnJvbSB0aGUgb3V0c2lkZSwgaXQgbWFrZXMgc2Vuc2UgdG8gdmFs aWRhdGUgaWYgaXQncyBhIHZhbGlkIGVuY29kaW5nIGJlZm9yZSBpdCBpcyB1 c2VkLiAgT3IgeW91IGNhbiBhc3N1bWUgdGhhdCBpbnB1dHMgYXJlIHZhbGlk IGFuZCByZWx5IG9uICJzeW1ib2wgbm90IGZvdW5kIiBhcyB0aGUgZ2VuZXJh bCB3YXkgdG8gaGFuZGxlIGFueXRoaW5nIHRoYXQgZG9lc24ndCBtYXRjaC4g IEZvciBnZGIsIHRoYXQgbWF5IGJlIGdvb2QgZW5vdWdoLg0KDQpZZXQgYW5v dGhlciBpc3N1ZTogZm9yIG1hbnkgY2hhcmFjdGVycywgdGhlcmUgYXJlIG11 bHRpcGxlIHdheXMgdG8gcmVwcmVzZW50IHRoZW0gaW4gVW5pY29kZS4gIEZv ciBleGFtcGxlLCDDvCAobGF0aW4gc21hbGwgbGV0dGVyIHUgd2l0aCBkaWVy ZXNpcykgY2FuIGJlIGNvZGVkIGFzIHRoZSBzaW5nbGUgVW5pY29kZSBjaGFy YWN0ZXIgMHhmYywgb3IgYXMgdGhlIHBhaXIgMHgwMzA4IDB4NzUgKGNvbWJp bmluZyBkaWVyZXNpcywgbGF0aW4gc21hbGwgbGV0dGVyIHUpLiAgVGhlc2Ug YXJlIHN1cHBvc2VkIHRvIGJlIHN5bm9ueW1vdXM7IHdoZW4gZG9pbmcgc3Ry aW5nIG1hdGNoZXMsIHlvdSdkIHdhbnQgdGhlbSB0byBiZSB0YWtlbiBhcyBl cXVpdmFsZW50LiAgVGhlIHN0cmluZ3ByZXAgbGlicmFyeSBoZWxwcyB3aXRo IHRoaXMgYnkgb2ZmZXJpbmcgYSBjb252ZXJzaW9uIHRvIGEgc3RhbmRhcmQg Zm9ybSwgYXQgd2hpY2ggcG9pbnQgbWVtY21wIHdpbGwgZ2l2ZSB0aGUgY29y cmVjdCBhbnN3ZXIuDQoNCglwYXVsDQoNCg== >From gdb-patches-return-147483-listarch-gdb-patches=sources.redhat.com@sourceware.org Mon May 21 18:14:28 2018 Return-Path: Delivered-To: listarch-gdb-patches@sources.redhat.com Received: (qmail 53112 invoked by alias); 21 May 2018 18:14:27 -0000 Mailing-List: contact gdb-patches-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: gdb-patches-owner@sourceware.org Delivered-To: mailing list gdb-patches@sourceware.org Received: (qmail 52577 invoked by uid 89); 21 May 2018 18:14:27 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.4 required=5.0 tests=AWL,BAYES_00,SPF_PASS autolearn=ham version=3.3.2 spammy=Hx-languages-length:667 X-HELO: eggs.gnu.org Received: from eggs.gnu.org (HELO eggs.gnu.org) (208.118.235.92) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Mon, 21 May 2018 18:14:25 +0000 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fKpJz-0007h7-7l for gdb-patches@sourceware.org; Mon, 21 May 2018 14:14:23 -0400 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:57727) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fKpJy-0007gg-OD; Mon, 21 May 2018 14:14:10 -0400 Received: from [176.228.60.248] (port=3272 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1fKpJy-00031d-59; Mon, 21 May 2018 14:14:10 -0400 Date: Mon, 21 May 2018 19:05:00 -0000 Message-Id: <83tvr0ev0p.fsf@gnu.org> From: Eli Zaretskii To: CC: simark@simark.ca, zjz@zjz.name, gdb-patches@sourceware.org In-reply-to: (Paul.Koning@dell.com) Subject: Re: support C/C++ identifiers named with non-ASCII characters Reply-to: Eli Zaretskii References: <9418d4f0-f22a-c587-cc34-2fa67afbd028@zjz.name> <8c8af079-dbb8-207b-5edf-86b99e9f5db8@simark.ca> <834lj1f0ne.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-IsSubscribed: yes X-SW-Source: 2018-05/txt/msg00501.txt.bz2 Content-length: 608 > From: > CC: , , > Date: Mon, 21 May 2018 18:03:17 +0000 > > > Is it a fact that non-ASCII identifiers must be encoded in UTF-8, and > > can not include invalid UTF-8 sequences? > > Encoding is a I/O question. Not necessarily. I asked that question because scanning a string for certain ASCII characters using a 'char *' pointer will only work reliably if the string is in UTF-8 or in some single-byte encoding. Otherwise, we might find false hits for the delimiters, which are actually parts of multibyte sequences.