From: Alan Hayward <Alan.Hayward@arm.com>
To: Pedro Alves <palves@redhat.com>
Cc: Simon Marchi <simon.marchi@polymtl.ca>,
"gdb-patches\\@sourceware.org" <gdb-patches@sourceware.org>,
nd <nd@arm.com>
Subject: Re: [PATCH] AArch64 pauth: Indicate unmasked addresses in backtrace
Date: Wed, 17 Jul 2019 16:07:00 -0000 [thread overview]
Message-ID: <8009C474-AE70-4A5B-A2D9-EB3B90626D95@arm.com> (raw)
In-Reply-To: <9c474f28-30f3-2428-d147-4474471a61ba@redhat.com>
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 6732 bytes --]
> On 17 Jul 2019, at 16:18, Pedro Alves <palves@redhat.com> wrote:
>
> On 7/17/19 4:01 PM, Simon Marchi wrote:
>> On 2019-07-17 09:35, Alan Hayward wrote:
>>>> On 17 Jul 2019, at 12:15, Pedro Alves <palves@redhat.com> wrote:
>>>>
>>>> On 7/17/19 9:14 AM, Alan Hayward wrote:
>>>>> Armv8.3-a Pointer Authentication causes the function return address to be
>>>>> obfuscated on entry to some functions. GDB must unmask the link register in
>>>>> order to produce a backtrace.
>>>>>
>>>>> The following patch adds markers of <unmasked> to the bracktrace, to indicate
>>>>> which addresses needed unmasking.
>>>>>
>>>>> For example, consider the following backtrace:
>>>>>
>>>>> (gdb) bt
>>>>> 0 0x0000000000400490 in puts@plt ()
>>>>> 1 0x00000000004005dc in foo ("hello") at cbreak-lib.c:6
>>>>> 2 0x0000000000400604<unmasked> in bar () at cbreak-lib.c:12
>>>>> 3 0x0000000000400620<unmasked> in barbar () at cbreak.c:17
>>>>> 4 0x00000000004005b4 in main () at cbreak-3.c:10
>>>>>
>>>>> The functions in the cbreak-lib use pointer auth, obfuscating the return address
>>>>> to the previous function. The caused the addresses of bar and barbar to require
>>>>> unmasking in order to unwind the backtrace.
>>>>>
>>>>> Alternatively, I considered replacing <unmasked> with a single chracter, such
>>>>> as * for brevity reasons, but felt this would be non obvious for the user.
>>>>
>>>> I don't have a particular suggestion, though my first reaction was that
>>>> it seemed a bit verbose.
>>>>
>>>> IMHO, the marker doesn't have to stand out and be expressive, since users can
>>>> always look at the manual.
>>>
>>> Reading the manual is an assumption Iâm not sure is anywhere near the
>>> common case.
>>> Saying that, I agree we shouldnât be designing the output for the non-readers.
>>>
>>> This comment has reminded me I need to add something to the manual as
>>> part of this
>>> patch.
>>>
>>>
>>>> Once they learn something, often being concise
>>>> helps -- or in other words, once you learn what "<unmasked>" or "U" or whatever
>>>> is, and you're used to it, what would you rather see? What's the main
>>>> information you're looking for when staring at the backtrace? Thoughts
>>>> like that should guide the output too, IMO.
>>>
>>> PAC is the official abbreviation for the feature, so maybe :PAC works best.
>>>
>>> (gdb) bt
>>> 0 0x0000000000400490 in puts@plt ()
>>> 1 0x00000000004005dc in foo ("hello") at cbreak-lib.c:6
>>> 2 0x0000000000400604:PAC in bar () at cbreak-lib.c:12
>>> 3 0x0000000000400620:PAC in barbar () at cbreak.c:17
>>> 4 0x00000000004005b4 in main () at cbreak-3.c:10
>>>
>>>
>>> Some of my attempts at different representations:
>>> 2 0x0000000000400604* in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604! in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604U in bar () at cbreak-lib.c:122
>>> 2 0x0000000000400604:U in bar () at cbreak-lib.c:122
>>> 2 0x0000000000400604<U> in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604[U] in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604<M> in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604<P> in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604<PAC> in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604PAC in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604:PAC in bar () at cbreak-lib.c:12
>>> 2 0x0000000000400604,PAC in bar () at cbreak-lib.c:12
>>>
>>> I found a single character was too hidden. A single character or symbol was also
>>> a little confusing - my brain read U as unsigned, * as pointer, [] as an array.
>>>
>>> I also like ,PAC as it might be easier to add future extensions.
>>
>> It might not be easily doable, but I think it would be nice if you could somehow make it so the function names stay aligned (regardless of which marker you end up choosing), like:
>>
>> 0 0x0000000000400490 in puts@plt ()
>> 1 0x00000000004005dc in foo ("hello") at cbreak-lib.c:6
>> 2 0x0000000000400604 [U] in bar () at cbreak-lib.c:12
>> 3 0x0000000000400620 [U] in barbar () at cbreak.c:17
>> 4 0x00000000004005b4 in main () at cbreak-3.c:10
>
> I almost suggested the same, but didn't when I realized that we
> don't always print the addresses:
>
> (top-gdb) bt
> #0 gdb_main (args=0x7fffffffd3a0) at src/gdb/main.c:1186
> #1 0x0000000000469a7e in main (argc=1, argv=0x7fffffffd4a8) at src/gdb/gdb.c:32
>
Whatâs the reason for that? Surely we always know the address of a function
in the backtrace? Can it happen in the middle of a backtrace?
> But if you do want to align the addresses, you could do that by
> specifying a width for the "addr" column.
> If "[U]" is rare, given no column
> headers, the spaces may look a bit odd, though.
In general, it depends how a binary/library was compiled. But Iâd expect a binary
to either have it in most functions or none.
Should be easy enough to remove the extra spaces if the system doesnât support PAC.
> Maybe you'd want to pre-compute
> the max column width by looking at the max number of frames that fit on a
> page, or something along those lines.
>
hmmm... ok. Iâll see what I can do there.
> On 17 Jul 2019, at 15:43, Pedro Alves <palves@redhat.com> wrote:
<SNIP>
>
> I'd go with either:
>
> 2 0x0000000000400604 (PAC) in bar () at cbreak-lib.c:12
> 2 0x0000000000400604 [PAC] in bar () at cbreak-lib.c:12
>
> Not having the space may make it a little bit harder
> to focus on low digits of the address.
>
>> my brain read U as unsigned, * as pointer, [] as an array.
>
> If you make it like 0x0000000000400604U, then I can see that.
>
> But not so much with:
>
> 2 0x0000000000400604 [U] in bar () at cbreak-lib.c:12
>
> You don't have to use a single letter, though:
>
> 2 0x0000000000400604 [UN] in bar () at cbreak-lib.c:12
>
> [] seems natural as a way to group some flags/properties to me.
>
> We already use it here for example:
>
> (top-gdb) info registers $eflags
> eflags 0x206 [ PF IF ]
>
>
> I guess I'm saying that it depends on context, and I wouldn't
> be worried with [] being confused with C arrays. Afterall,
> < and > also have meaning in C/C++... More than one meaning,
> actually. :-)
>
The extra space really does help there.
Given PAC really is an AArch64 thing (as opposed to something more
generic like Unmasked) might be worth adding a
gdbarch_print_function_address () or something like that so that I
can override it in aarch64. Assuming it fits with all the width
calculations.
Alan.
\x16º&Öéj×!zÊÞ¶êç×ùë¹b²Ö«r\x18\x1dnr\x17¬
next prev parent reply other threads:[~2019-07-17 16:07 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-07-17 8:14 Alan Hayward
2019-07-17 11:15 ` Pedro Alves
2019-07-17 13:36 ` Alan Hayward
2019-07-17 14:44 ` Pedro Alves
2019-07-17 15:02 ` Simon Marchi
2019-07-17 15:18 ` Pedro Alves
2019-07-17 16:07 ` Alan Hayward [this message]
2019-07-17 16:41 ` Pedro Alves
2019-07-17 17:34 ` Alan Hayward
2019-07-18 13:48 ` Tom Tromey
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8009C474-AE70-4A5B-A2D9-EB3B90626D95@arm.com \
--to=alan.hayward@arm.com \
--cc=gdb-patches@sourceware.org \
--cc=nd@arm.com \
--cc=palves@redhat.com \
--cc=simon.marchi@polymtl.ca \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox