Mirror of the gdb-patches mailing list
 help / color / mirror / Atom feed
From: Doug Evans <dje@google.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: gdb-patches@sourceware.org, tromey@redhat.com, brobecker@adacore.com
Subject: Re: [RFA, doc RFA] Avoid calling gdb_realpath if basenames are different
Date: Fri, 11 Nov 2011 09:00:00 -0000	[thread overview]
Message-ID: <CADPb22ReomSdvRUCJ=h4BD3KMpWgBsdtHDspXgGNr-iHvKh4aw@mail.gmail.com> (raw)
In-Reply-To: <8362irgi5w.fsf@gnu.org>

On Fri, Nov 11, 2011 at 12:47 AM, Eli Zaretskii <eliz@gnu.org> wrote:
>> Date: Thu, 10 Nov 2011 15:58:46 -0800
>> From: Doug Evans <dje@google.com>
>>
>> 2011-11-10  Doug Evans  <dje@google.com>
>>
>>         * NEWS: Mention new parameter basenames-may-differ.
>>         * dwarf2read.c (dw2_lookup_symtab): Avoid calling gdb_realpath if
>>         ! basenames_may_differ.
>>         * psymtab.c (lookup_partial_symtab): Ditto.
>>         * symtab.c (lookup_symtab): Ditto.
>>         (basenames_may_differ): New global.
>>         (_initialize_symtab): New parameter basenames-may-differ.
>>         * symtab.h (basenames_may_differ): Declare.
>>
>>         doc/
>>         * gdb.texinfo (Files): Document basenames-may-differ.
>
> Thanks.
>
>> +set basenames-may-differ
>> +show basenames-may-differ
>> +  Set whether a source file may have multiple base names.
>> +  A "base name" is the name of a file with the directory part removed.
>> +  Example: The base name of "/home/user/hello.c" is "hello.c".
>> +  When doing file name based lookups, gdb will canonicalize file names
>> +  (e.g., expand symlinks) before comparing them, which is an expensive
>> +  operation.
>> +  If set, gdb will not assume a file is known by one base name, and thus
>> +  it cannot optimize file name comparisions by skipping the canonicalization
>> +  step if the base names are different.
>> +  If not set, all source files must be known by one base name,
>> +  and gdb will do file name comparisons more efficiently.
>
> I suggest to rearrange the text, so as to put together the parts that
> describe what happens when the option is set.  Like this:
>
>  Set whether a source file may have multiple base names.
>  (A "base name" is the name of a file with the directory part removed.
>  Example: The base name of "/home/user/hello.c" is "hello.c".)
>  If set, GDB will canonicalize file names (e.g., expand symlinks)
>  before comparing them.  Canonicalization is an expensive operation,
>  but it allows the same file be known by more than one base name.
>  If not set (the default), all source files are assumed to have just
>  one base name, and gdb will do file name comparisons more efficiently.
>
> OK?
>
>> +When processing file names provided by the user,
>> +@value{GDBN} will canonicalize them and remove symbolic links.
>> +This ensures that @value{GDBN} will find the right file,
>> +even if the debug information specifies an alternate path.
>> +However, with large programs this canonicalization can noticeably slow
>> +down @value{GDBN}.  To compensate, @value{GDBN} will try to avoid
>> +this canonicalization wherever possible.  One way it can do so
>> +is by first comparing the @samp{base name} of a file.
>> +The @samp{base name} of a file is simply the file's name without
>> +any directory information.  For example, the base name of
>> +@file{/home/user/hello.c} is @file{hello.c}.
>> +By doing this @value{GDBN} can skip, for example,
>> +@file{/usr/include/stdio.h} without having to first canonicalize
>> +and then compare the directory names.
>> +This works great, except when the base name of a file
>> +can have multiple names due to symbolic links.
>> +For example, if @file{/home/user/bar.c} is a symbolic link to
>> +@file{/home/user/foo.c} then @value{GDBN} cannot just look at
>> +the base name of two files, it must canonicalize them, expand
>> +all symbolic links, and @emph{then} compare the file names
>> +to see if they match.
>> +Fortunately, having one file known by two different base names
>> +does not generally occur in practice.
>> +Should it occur, however, @value{GDBN} provides an escape hatch
>> +to allow this to work.
>> +By setting @code{basenames-may-differ} to @code{true}
>> +@value{GDBN} will always canonicalize file names before
>> +comparing them, thus ensuring that one file known by multiple
>> +base names are treated as the same file.
>
> This is written as mostly an apology for having this option.  That is
> a wrong angle for describing features in a user manual, because the
> user generally trusts the developers by default to DTRT.  So I would
> reword it
>
>  When processing file names provided by the user, @value{GDBN}
>  frequently needs to compare them to the file names recorded in the
>  program's debug info.  Normally, @value{GDBN} compares just the
>  @dfn{base names} of the files as strings, which is reasonably fast
>  even for very large programs.  (The base name of a file is the last
>  portion of its name, after stripping all the leading directories.)
>  This shortcut in comparison is based upon the assumption that files
>  cannot have more than one base name.  This is usually true, but
>  references to files that use symlinks or similar filesystem
>  facilities violate that assumption.  If your program records files
>  using such facilities, or if you provide file names to @value{GDBN}
>  using symlinks etc., you can set @code{basenames-may-differ} to
>  @code{true} to instruct @value{GDBN} to completely canonicalize each
>  pair of file names it needs to compare.  This will make file-name
>  comparisons accurate, but at a price of a significant slowdown.
>
> Do you agree with this wording?
>

I'm happy if you're happy.
Thanks for the suggested wording.


  reply	other threads:[~2011-11-11  9:00 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-11-06  6:31 [RFC] " Doug Evans
2011-11-06  9:07 ` asmwarrior
2011-11-07 17:06 ` Joel Brobecker
2011-11-08 17:18 ` Tom Tromey
2011-11-11  0:57 ` [RFA, doc RFA] " Doug Evans
2011-11-11  8:49   ` Eli Zaretskii
2011-11-11  9:00     ` Doug Evans [this message]
2011-11-15  4:46       ` Doug Evans
2011-11-15  6:00         ` Eli Zaretskii
2011-11-15 14:23         ` Joel Brobecker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CADPb22ReomSdvRUCJ=h4BD3KMpWgBsdtHDspXgGNr-iHvKh4aw@mail.gmail.com' \
    --to=dje@google.com \
    --cc=brobecker@adacore.com \
    --cc=eliz@gnu.org \
    --cc=gdb-patches@sourceware.org \
    --cc=tromey@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox