From: Doug Evans <dje@google.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: gdb-patches@sourceware.org, tromey@redhat.com, brobecker@adacore.com
Subject: Re: [RFA, doc RFA] Avoid calling gdb_realpath if basenames are different
Date: Fri, 11 Nov 2011 09:00:00 -0000 [thread overview]
Message-ID: <CADPb22ReomSdvRUCJ=h4BD3KMpWgBsdtHDspXgGNr-iHvKh4aw@mail.gmail.com> (raw)
In-Reply-To: <8362irgi5w.fsf@gnu.org>
On Fri, Nov 11, 2011 at 12:47 AM, Eli Zaretskii <eliz@gnu.org> wrote:
>> Date: Thu, 10 Nov 2011 15:58:46 -0800
>> From: Doug Evans <dje@google.com>
>>
>> 2011-11-10 Doug Evans <dje@google.com>
>>
>> * NEWS: Mention new parameter basenames-may-differ.
>> * dwarf2read.c (dw2_lookup_symtab): Avoid calling gdb_realpath if
>> ! basenames_may_differ.
>> * psymtab.c (lookup_partial_symtab): Ditto.
>> * symtab.c (lookup_symtab): Ditto.
>> (basenames_may_differ): New global.
>> (_initialize_symtab): New parameter basenames-may-differ.
>> * symtab.h (basenames_may_differ): Declare.
>>
>> doc/
>> * gdb.texinfo (Files): Document basenames-may-differ.
>
> Thanks.
>
>> +set basenames-may-differ
>> +show basenames-may-differ
>> + Set whether a source file may have multiple base names.
>> + A "base name" is the name of a file with the directory part removed.
>> + Example: The base name of "/home/user/hello.c" is "hello.c".
>> + When doing file name based lookups, gdb will canonicalize file names
>> + (e.g., expand symlinks) before comparing them, which is an expensive
>> + operation.
>> + If set, gdb will not assume a file is known by one base name, and thus
>> + it cannot optimize file name comparisions by skipping the canonicalization
>> + step if the base names are different.
>> + If not set, all source files must be known by one base name,
>> + and gdb will do file name comparisons more efficiently.
>
> I suggest to rearrange the text, so as to put together the parts that
> describe what happens when the option is set. Like this:
>
> Set whether a source file may have multiple base names.
> (A "base name" is the name of a file with the directory part removed.
> Example: The base name of "/home/user/hello.c" is "hello.c".)
> If set, GDB will canonicalize file names (e.g., expand symlinks)
> before comparing them. Canonicalization is an expensive operation,
> but it allows the same file be known by more than one base name.
> If not set (the default), all source files are assumed to have just
> one base name, and gdb will do file name comparisons more efficiently.
>
> OK?
>
>> +When processing file names provided by the user,
>> +@value{GDBN} will canonicalize them and remove symbolic links.
>> +This ensures that @value{GDBN} will find the right file,
>> +even if the debug information specifies an alternate path.
>> +However, with large programs this canonicalization can noticeably slow
>> +down @value{GDBN}. To compensate, @value{GDBN} will try to avoid
>> +this canonicalization wherever possible. One way it can do so
>> +is by first comparing the @samp{base name} of a file.
>> +The @samp{base name} of a file is simply the file's name without
>> +any directory information. For example, the base name of
>> +@file{/home/user/hello.c} is @file{hello.c}.
>> +By doing this @value{GDBN} can skip, for example,
>> +@file{/usr/include/stdio.h} without having to first canonicalize
>> +and then compare the directory names.
>> +This works great, except when the base name of a file
>> +can have multiple names due to symbolic links.
>> +For example, if @file{/home/user/bar.c} is a symbolic link to
>> +@file{/home/user/foo.c} then @value{GDBN} cannot just look at
>> +the base name of two files, it must canonicalize them, expand
>> +all symbolic links, and @emph{then} compare the file names
>> +to see if they match.
>> +Fortunately, having one file known by two different base names
>> +does not generally occur in practice.
>> +Should it occur, however, @value{GDBN} provides an escape hatch
>> +to allow this to work.
>> +By setting @code{basenames-may-differ} to @code{true}
>> +@value{GDBN} will always canonicalize file names before
>> +comparing them, thus ensuring that one file known by multiple
>> +base names are treated as the same file.
>
> This is written as mostly an apology for having this option. That is
> a wrong angle for describing features in a user manual, because the
> user generally trusts the developers by default to DTRT. So I would
> reword it
>
> When processing file names provided by the user, @value{GDBN}
> frequently needs to compare them to the file names recorded in the
> program's debug info. Normally, @value{GDBN} compares just the
> @dfn{base names} of the files as strings, which is reasonably fast
> even for very large programs. (The base name of a file is the last
> portion of its name, after stripping all the leading directories.)
> This shortcut in comparison is based upon the assumption that files
> cannot have more than one base name. This is usually true, but
> references to files that use symlinks or similar filesystem
> facilities violate that assumption. If your program records files
> using such facilities, or if you provide file names to @value{GDBN}
> using symlinks etc., you can set @code{basenames-may-differ} to
> @code{true} to instruct @value{GDBN} to completely canonicalize each
> pair of file names it needs to compare. This will make file-name
> comparisons accurate, but at a price of a significant slowdown.
>
> Do you agree with this wording?
>
I'm happy if you're happy.
Thanks for the suggested wording.
next prev parent reply other threads:[~2011-11-11 9:00 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-06 6:31 [RFC] " Doug Evans
2011-11-06 9:07 ` asmwarrior
2011-11-07 17:06 ` Joel Brobecker
2011-11-08 17:18 ` Tom Tromey
2011-11-11 0:57 ` [RFA, doc RFA] " Doug Evans
2011-11-11 8:49 ` Eli Zaretskii
2011-11-11 9:00 ` Doug Evans [this message]
2011-11-15 4:46 ` Doug Evans
2011-11-15 6:00 ` Eli Zaretskii
2011-11-15 14:23 ` Joel Brobecker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CADPb22ReomSdvRUCJ=h4BD3KMpWgBsdtHDspXgGNr-iHvKh4aw@mail.gmail.com' \
--to=dje@google.com \
--cc=brobecker@adacore.com \
--cc=eliz@gnu.org \
--cc=gdb-patches@sourceware.org \
--cc=tromey@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox