From: Simon Marchi <simark@simark.ca>
To: Tom Tromey <tom@tromey.com>, gdb-patches@sourceware.org
Subject: Re: [PATCH v2 11/28] Rewrite the .gdb_index reader
Date: Wed, 23 Apr 2025 13:22:37 -0400 [thread overview]
Message-ID: <6785bbf8-4a37-454a-abcb-747f27b8e153@simark.ca> (raw)
In-Reply-To: <20250402-search-in-psyms-v2-11-ea91704487cb@tromey.com>
On 2025-04-02 19:45, Tom Tromey wrote:
> This patch rewrites the .gdb_index reader to create the same data
> structures that are created by the cooked indexer and the .debug_names
> reader.
>
> This is done in support of this series; but also because, from what I
> can tell, the "templates.exp" change didn't really work properly with
> this reader.
>
> In addition to fixing that problem, this patch removes a lot of code.
>
> Implementing this required a couple of hacks, as .gdb_index does not
> contain all the information that's used by the cooked index
> implementation.
>
> * The index-searching code likes to differentiate between the various
> DWARF tags when matching, but .gdb_index lumps many things into a
> single "other" category. To handle this, we introduce a phony tag
> that's used so that the match method can match on multiple domains.
>
> * Similarly, .gdb_index doesn't distinguish between the type and
> struct domains, so another phony tag is used for this.
>
> * Support for older versions of .gdb_index is removed entirely.
>
> * The reader must attempt to guess the language of various symbols.
> This is somewhat finicky. "Plain" (unqualified) symbols are marked
> as language_unknown and then a couple of hacks are used to handle
> these -- one in expand_symtabs_matching and another when recognizing
> "main".
>
> For what it's worth, I consider .gdb_index to be near the end of its
> life. While .debug_names is not perfect -- we found a number of bugs
> in the standard while implementing it -- it is better than .gdb_index
> and also better documented.
>
> After this patch, we could conceivably remove dwarf_scanner_base.
> However, I have not done this.
>
> Finally, this patch also changes this reader to dump the content of
> the index, as the other DWARF readers do. This can be handy when
> debugging gdb.
I'm not knowledgeable enough with the .gdb_index quirks to have an
informed opinion about this. I am happy with this change because it
brings it in line with the other two we already have (.debug_info and
.debug_names).
I noted some random comments below.
> diff --git a/gdb/dwarf2/cooked-index-shard.c b/gdb/dwarf2/cooked-index-shard.c
> index 29a8aea513786e4c1c1ed77dee8610fc329d1c8a..888a0fa345b9124e2814aa722e71a9d2fd5adf8e 100644
> --- a/gdb/dwarf2/cooked-index-shard.c
> +++ b/gdb/dwarf2/cooked-index-shard.c
> @@ -86,7 +86,16 @@ cooked_index_shard::add (sect_offset die_offset, enum dwarf_tag tag,
> implicit "main" discovery. */
> if ((flags & IS_MAIN) != 0)
> m_main = result;
> - else if ((flags & IS_PARENT_DEFERRED) == 0
> + /* The language check here is subtle: it exists solely to work
> + around a bug in .gdb_index. That index does not record
> + languages, but it might emit an entry for "main". However,
> + recognizing this "main" as being the main program would be wrong
> + -- for example, an Ada program has a C "main" but this is not the
> + desired target of the "start" command. Requiring the language to
> + be set here avoids over-eagerly setting the "main" when using
> + .gdb_index. Should .gdb_index ever be removed (PR symtab/31363),
> + the language_unknown check here could also be removed. */
> + else if (lang != language_unknown
Did you remove the IS_PARENT_DEFERRED check on purpose?
> @@ -203,10 +209,10 @@ enum class cooked_state
> /* An object of this type controls the scanning of the DWARF. It
> schedules the worker tasks and tracks the current state. Once
> scanning is done, this object is discarded.
> -
> +
Trailing spaces ^.
> +/* This is like a cooked index, but as it has been ingested from
> + .gdb_index, it can't be used to write out an index. */
> +
> +class cooked_gdb_index : public cooked_index
Technically, I think it should be "This is a cooked index as ingested
from .gdb_index". You know, the "is-a" relationship. Currently, the
base class is the "from .debug_info" case and we have some derived
classes for the indexes. I feel like we should (eventually) move to
having an abstract base class and add a derived class for the "from
.debug_info" case.
> + /* Note that this assumes the final component ends in \0. */
> + cooked_index_entry *entry = result.add (per_cu->sect_off, tag,
> + flags, this_lang,
> + components.back ().data (),
> + nullptr, per_cu);
> + /* Don't bother pushing if we do not need a panrent. */
panrent -> parent
Simon
next prev parent reply other threads:[~2025-04-23 17:23 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-02 23:44 [PATCH v2 00/28] Search symbols via quick API Tom Tromey
2025-04-02 23:45 ` [PATCH v2 01/28] Add another minor hack to cooked_index_entry::full_name Tom Tromey
2025-04-02 23:45 ` [PATCH v2 02/28] Change ada_decode to preserve upper-case in some situations Tom Tromey
2025-04-02 23:45 ` [PATCH v2 03/28] Emit some type declarations in .gdb_index Tom Tromey
2025-04-21 2:50 ` Simon Marchi
2025-04-21 14:50 ` Tom Tromey
2025-04-23 4:11 ` Simon Marchi
2025-04-23 20:54 ` Tom Tromey
2025-04-02 23:45 ` [PATCH v2 04/28] Ada import functions not in index Tom Tromey
2025-04-02 23:45 ` [PATCH v2 05/28] Fix index's handling of DW_TAG_imported_declaration Tom Tromey
2025-04-02 23:45 ` [PATCH v2 06/28] Put all CTF symbols in global scope Tom Tromey
2025-04-02 23:45 ` [PATCH v2 07/28] Restore "ingestion" of .debug_str when writing .debug_names Tom Tromey
2025-04-02 23:45 ` [PATCH v2 08/28] Entries from anon-struct.exp not in cooked index Tom Tromey
2025-04-02 23:45 ` [PATCH v2 09/28] Remove dwarf2_per_cu_data::mark Tom Tromey
2025-04-21 3:09 ` Simon Marchi
2025-04-21 15:38 ` Tom Tromey
2025-04-23 4:12 ` Simon Marchi
2025-04-02 23:45 ` [PATCH v2 10/28] Have expand_symtabs_matching work for already-expanded CUs Tom Tromey
2025-04-23 15:53 ` Simon Marchi
2025-04-23 20:39 ` Tom Tromey
2025-04-23 20:57 ` Tom Tromey
2025-04-02 23:45 ` [PATCH v2 11/28] Rewrite the .gdb_index reader Tom Tromey
2025-04-23 17:22 ` Simon Marchi [this message]
2025-04-23 20:50 ` Tom Tromey
2025-04-24 14:37 ` Pedro Alves
2025-04-02 23:45 ` [PATCH v2 12/28] Convert default_collect_symbol_completion_matches_break_on Tom Tromey
2025-04-02 23:45 ` [PATCH v2 13/28] Convert gdbpy_lookup_static_symbols Tom Tromey
2025-04-02 23:45 ` [PATCH v2 14/28] Convert ada_add_global_exceptions Tom Tromey
2025-04-02 23:45 ` [PATCH v2 15/28] Convert ada_language_defn::collect_symbol_completion_matches Tom Tromey
2025-04-02 23:45 ` [PATCH v2 16/28] Convert ada-lang.c:map_matching_symbols Tom Tromey
2025-04-02 23:45 ` [PATCH v2 17/28] Remove expand_symtabs_matching Tom Tromey
2025-04-02 23:45 ` [PATCH v2 18/28] Simplify basic_lookup_transparent_type Tom Tromey
2025-04-02 23:45 ` [PATCH v2 19/28] Remove objfile::expand_symtabs_for_function Tom Tromey
2025-04-02 23:45 ` [PATCH v2 20/28] Convert linespec.c:iterate_over_all_matching_symtabs Tom Tromey
2025-04-02 23:45 ` [PATCH v2 21/28] Simplify block_lookup_symbol_primary Tom Tromey
2025-04-02 23:45 ` [PATCH v2 22/28] Pass lookup_name_info to block_lookup_symbol_primary Tom Tromey
2025-04-02 23:45 ` [PATCH v2 23/28] Simplify block_lookup_symbol Tom Tromey
2025-04-02 23:45 ` [PATCH v2 24/28] Add best_symbol_tracker Tom Tromey
2025-04-02 23:45 ` [PATCH v2 25/28] Convert lookup_symbol_via_quick_fns Tom Tromey
2025-04-02 23:45 ` [PATCH v2 26/28] Convert lookup_symbol_in_objfile Tom Tromey
2025-04-02 23:45 ` [PATCH v2 27/28] Make dw_expand_symtabs_matching_file_matcher static Tom Tromey
2025-04-23 20:00 ` Simon Marchi
2025-04-23 20:09 ` Tom Tromey
2025-04-23 20:44 ` Tom Tromey
2025-04-02 23:45 ` [PATCH v2 28/28] Remove enter_symbol_lookup Tom Tromey
2025-04-23 20:09 ` [PATCH v2 00/28] Search symbols via quick API Simon Marchi
2025-04-24 21:09 ` Tom Tromey
2025-04-28 14:07 ` Guinevere Larsen
2025-04-28 22:06 ` Tom Tromey
2025-04-29 19:31 ` Guinevere Larsen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6785bbf8-4a37-454a-abcb-747f27b8e153@simark.ca \
--to=simark@simark.ca \
--cc=gdb-patches@sourceware.org \
--cc=tom@tromey.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox