From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gdb-return-6976-listarch-gdb=sourceware.cygnus.com@sources.redhat.com>
Received: (qmail 24418 invoked by alias); 29 Nov 2001 00:02:46 -0000
Mailing-List: contact gdb-help@sourceware.cygnus.com; run by ezmlm
Precedence: bulk
List-Subscribe: <mailto:gdb-subscribe@sources.redhat.com>
List-Archive: <http://sources.redhat.com/ml/gdb/>
List-Post: <mailto:gdb@sources.redhat.com>
List-Help: <mailto:gdb-help@sources.redhat.com>, <http://sources.redhat.com/ml/#faqs>
Sender: gdb-owner@sources.redhat.com
Received: (qmail 24268 invoked from network); 29 Nov 2001 00:02:40 -0000
Received: from unknown (HELO gash2.peakpeak.com) (207.174.178.17)
  by hostedprojects.ges.redhat.com with SMTP; 29 Nov 2001 00:02:40 -0000
Received: from creche.cygnus.com (ta0204.peakpeak.com [204.144.244.204])
	by gash2.peakpeak.com (8.9.3/8.9.3) with ESMTP id RAA07071;
	Wed, 28 Nov 2001 17:01:35 -0700
Received: (from tromey@localhost)
	by creche.cygnus.com (8.9.3/8.9.3) id RAA25235;
	Wed, 28 Nov 2001 17:03:46 -0700
To: Eli Zaretskii <eliz@is.elta.co.il>
Cc: Klaus-Georg Adams <Klaus-Georg.Adams@sap.com>, gdb@sources.redhat.com
Subject: Re: displaying wchar_t in gdb
References: <Pine.SUN.3.91.1011126140745.12758B-100000@is>
X-Zippy:  When I met th'POPE back in '58, I scrubbed him with a MILD SOAP
 or DETERGENT for 15 minutes.  He seemed to enjoy it..
X-Attribution:  Tom
Reply-To: tromey@redhat.com
From: Tom Tromey <tromey@redhat.com>
Date: Thu, 22 Nov 2001 14:30:00 -0000
In-Reply-To: Eli Zaretskii's message of "Mon, 26 Nov 2001 14:10:25 +0200 (IST)"
Message-ID: <87u1veigta.fsf@creche.redhat.com>
X-Mailer: Gnus v5.7/Emacs 20.5
X-SW-Source: 2001-11/txt/msg00223.txt.bz2

>>>>> "Eli" == Eli Zaretskii <eliz@is.elta.co.il> writes:

>> What would be a strategy to implement this natively in gdb? Can you
>> tell inside gdb if we are working with wchar_t? Should there be a
>> separate format for this?

Eli> I'm not even sure this is feasible, taking the cross-debugging
Eli> into consideration.  I guess it's possible in native debugging,
Eli> assuming GDB and the debuggee use compatible libraries for wide
Eli> character support, and support the same character sets.

I think it is feasible if you assume first that the host has a
high-powered iconv() implementation (Linux does, other systems are
typically less good -- but there is always libiconv) and second that
the target's wchar_t is a well-known encoding and not some peculiar
thing.

With these assumptions the problem becomes one of telling gdb what
encoding to expect when printing wchar_t strings.  The terminal's
encoding can just come from the current locale.

Eli> If it _is_ possible and feasible, then a special format is
Eli> probably the way to go.

For wchar_t I don't think you need a new `print' format (well maybe to
specify the encoding).  I think a wchar_t string could be printed
based solely on the type, the way we print a char* string right now.

It would also be possible to print multibyte strings this way too.  In
this case you'd want a special format.

This functionality might be mildly useful for Java debugging (for Java
the problem is simpler as the target's encoding is always UCS-2).
Right now I believe we print non-ASCII characters using `\u' escapes.
I haven't yet run into a situation where this is insufficient, but I
suppose it is possible.

Tom


From mboxrd@z Thu Jan  1 00:00:00 1970
From: Tom Tromey <tromey@redhat.com>
To: Eli Zaretskii <eliz@is.elta.co.il>
Cc: Klaus-Georg Adams <Klaus-Georg.Adams@sap.com>, gdb@sources.redhat.com
Subject: Re: displaying wchar_t in gdb
Date: Wed, 28 Nov 2001 16:02:00 -0000
Message-ID: <87u1veigta.fsf@creche.redhat.com>
References: <Pine.SUN.3.91.1011126140745.12758B-100000@is>
X-SW-Source: 2001-11/msg00330.html
Message-ID: <20011128160200.depUixPBSbJvqPec1d51qtn3vYNcfbEJFXwlkAifHRI@z>

>>>>> "Eli" == Eli Zaretskii <eliz@is.elta.co.il> writes:

>> What would be a strategy to implement this natively in gdb? Can you
>> tell inside gdb if we are working with wchar_t? Should there be a
>> separate format for this?

Eli> I'm not even sure this is feasible, taking the cross-debugging
Eli> into consideration.  I guess it's possible in native debugging,
Eli> assuming GDB and the debuggee use compatible libraries for wide
Eli> character support, and support the same character sets.

I think it is feasible if you assume first that the host has a
high-powered iconv() implementation (Linux does, other systems are
typically less good -- but there is always libiconv) and second that
the target's wchar_t is a well-known encoding and not some peculiar
thing.

With these assumptions the problem becomes one of telling gdb what
encoding to expect when printing wchar_t strings.  The terminal's
encoding can just come from the current locale.

Eli> If it _is_ possible and feasible, then a special format is
Eli> probably the way to go.

For wchar_t I don't think you need a new `print' format (well maybe to
specify the encoding).  I think a wchar_t string could be printed
based solely on the type, the way we print a char* string right now.

It would also be possible to print multibyte strings this way too.  In
this case you'd want a special format.

This functionality might be mildly useful for Java debugging (for Java
the problem is simpler as the target's encoding is always UCS-2).
Right now I believe we print non-ASCII characters using `\u' escapes.
I haven't yet run into a situation where this is insufficient, but I
suppose it is possible.

Tom