Re: [PATCH v3] Teach GDB to generate sparse core files (PR corefiles/31494)

Mirror of the gdb-patches mailing list
 help / color / mirror / Atom feed

From: Lancelot SIX <Lancelot.Six@amd.com>
To: Pedro Alves <pedro@palves.net>, gdb-patches@sourceware.org
Subject: Re: [PATCH v3] Teach GDB to generate sparse core files (PR corefiles/31494)
Date: Thu, 21 Mar 2024 21:27:19 +0000	[thread overview]
Message-ID: <5f504bb2-75b2-43fb-b74f-708cf0c46bb6@amd.com> (raw)
In-Reply-To: <269ff31a-9aeb-4293-a4d9-df0f16f12e88@palves.net>

Hi,

I have a couple of small comments below.

On 18/03/2024 17:43, Pedro Alves wrote:
> diff --git a/gdb/gcore.c b/gdb/gcore.c
> @@ -567,6 +583,158 @@ objfile_find_memory_regions (struct target_ops *self,
>     return 0;
>   }
> 
> +/* Check if we have a block full of zeros at DATA within the [DATA,
> +   DATA+SIZE) buffer.  Returns the size of the all-zero block found.
> +   Returns at most the minimum between SIZE and SPARSE_BLOCK_SIZE.  */
> +
> +static size_t
> +get_all_zero_block_size (const gdb_byte *data, size_t size)
> +{
> +  size = std::min (size, (size_t) SPARSE_BLOCK_SIZE);
> +
> +  /* A memcmp of a whole block is much faster than a simple for loop.
> +     This makes a big difference, as with a for loop, this code would
> +     dominate the performance and result in doubling the time to
> +     generate a core, at the time of writing.  With an optimized
> +     memcmp, this doesn't even show up in the perf trace.  */
> +  static const gdb_byte all_zero_block[SPARSE_BLOCK_SIZE] = {};
> +  if (memcmp (data, all_zero_block, size) == 0)
> +    return size;
> +  return 0;
> +}
> +
> +/* Basically a named-elements pair, used as return type of
> +   find_next_all_zero_block.  */
> +
> +struct offset_and_size
> +{
> +  size_t offset;
> +  size_t size;
> +};
> +
> +/* Find the next all-zero block at DATA+OFFSET within the [DATA,
> +   DATA+SIZE) buffer.  Returns the offset and the size of the all-zero
> +   block if found, or zero if not found.  */
> +
> +static offset_and_size
> +find_next_all_zero_block (const gdb_byte *data, size_t offset, size_t size)
> +{
> +  for (; offset < size; offset += SPARSE_BLOCK_SIZE)
> +    {
> +      size_t zero_block_size
> +       = get_all_zero_block_size (data + offset, size - offset);
> +      if (zero_block_size != 0)
> +       return {offset, zero_block_size};
> +    }
> +  return {0, 0};
> +}
> +
> +/* Wrapper around bfd_set_section_contents that avoids writing
> +   all-zero blocks to disk, so we create a sparse core file.
> +   SKIP_ALIGN is a recursion helper -- if true, we'll skip aligning
> +   the file position to SPARSE_BLOCK_SIZE.  */
> +
> +static bool
> +sparse_bfd_set_section_contents (bfd *obfd, asection *osec,
> +                                const gdb_byte *data,
> +                                size_t sec_offset,
> +                                size_t size,
> +                                bool skip_align = false)
> +{
> +  /* Note, we don't have to have special handling for the case of the
> +     last memory region ending with zeros, because our caller always
> +     writes out the note section after the memory/load sections.  If
> +     it didn't, we'd have to seek+write the last byte to make the file
> +     size correct.  (Or add an ftruncate abstraction to bfd and call
> +     that.)  */
> +
> +  if (!skip_align)
> +    {
> +      /* Align the all-zero block search with SPARSE_BLOCK_SIZE, to
> +        better align with filesystem blocks.  If we find we're
> +        misaligned, then write/skip the bytes needed to make us
> +        aligned.  We do that with (one level) recursion.  */
> +
> +      /* We need to know the section's file offset on disk.  We can
> +        only look at it after the bfd's 'output_has_begun' flag has
> +        been set, as bfd hasn't computed the file offsets
> +        otherwise.  */
> +      if (!obfd->output_has_begun)
> +       {
> +         gdb_byte dummy = 0;
> +
> +         /* A write forces BFD to compute the bfd's section file
> +            positions.  Zero size works for that too.  */
> +         if (!bfd_set_section_contents (obfd, osec, &dummy, 0, 0))
> +           return false;
> +
> +         gdb_assert (obfd->output_has_begun);
> +       }
> +
> +      /* How much we need to write/skip in order to find the next
> +        SPARSE_BLOCK_SIZE filepos-aligned block.  */
> +      size_t align_remainder
> +       = (SPARSE_BLOCK_SIZE
> +          - (osec->filepos + sec_offset) % SPARSE_BLOCK_SIZE);
> +
> +      /* How much we'll actually write in the recursion call.  */
> +      size_t align_write_size = std::min (size, align_remainder);
> +
> +      if (align_write_size != 0)

I think at this point align_write_size can be SPARSE_BLOCK_SIZE (i.e. 
sec_offset lands at a SPARSE_BLOCK_SIZE boundary in the underlying 
filesystem).  If that's the case, and data+sec_offset starts with block 
of 0s, you'll write it to disk needlessly.  Not a big deal, but I'd go for:

     if (align_write_size % SPARSE_BLOCK_SIZE != 0)

> +       {
> +         /* Recurse, skipping the alignment code.  */
> +         if (!sparse_bfd_set_section_contents (obfd, osec, data,
> +                                               sec_offset,
> +                                               align_write_size, true))
> +           return false;
> +
> +         /* Skip over what we've written, and proceed with
> +            assumes-aligned logic.  */
> +         data += align_write_size;
> +         sec_offset += align_write_size;
> +         size -= align_write_size;
> +       }
> +    }
> +
> +  size_t data_offset = 0;

Just because that got me to think while reading, having the first part 
of the procedure update data/sec_offset/size and the second part of the 
procedure update data_offset seems a bit inconsistent.

I would probably move the declaration of data_offset at the very 
begining of the procedure update it consistently:

     size_t data_offset = 0;
     if (!skip_align)
       {
         […]
         if (align_write_size % SPARSE_BLOCK_SIZE != 0)
           {
             […]
             data_offset += align_write_size;
           }
        }
      while (data_offset < size)
        […]

> +  while (data_offset < size)
> +    {
> +      size_t all_zero_block_size
> +       = get_all_zero_block_size (data + data_offset, size - data_offset);
> +      if (all_zero_block_size != 0)
> +       data_offset += all_zero_block_size;
> +      else
> +       {
> +         /* We have some non-zero data to write to file.  Find the
> +            next all-zero block within the data, and only write up to
> +            it.  */
> +
> +         offset_and_size next_all_zero_block
> +           = find_next_all_zero_block (data,
> +                                       data_offset + SPARSE_BLOCK_SIZE,
> +                                       size);
> +         size_t next_data_offset = (next_all_zero_block.offset == 0
> +                                    ? size
> +                                    : next_all_zero_block.offset);
> +
> +         if (!bfd_set_section_contents (obfd, osec, data + data_offset,
> +                                        sec_offset + data_offset,
> +                                        next_data_offset - data_offset))
> +           return false;
> +
> +         data_offset = next_data_offset;
> +
> +         /* If we already know we have an all-zero block at the next
> +            offset, we can skip calling get_all_zero_block_size for
> +            it again.  */
> +         if (next_all_zero_block.offset != 0)
> +           data_offset += next_all_zero_block.size;
> +       }
> +    }
> +
> +  return true;
> +}
> +
>   static void
>   gcore_copy_callback (bfd *obfd, asection *osec)
>   {
> @@ -599,8 +767,9 @@ gcore_copy_callback (bfd *obfd, asection *osec)
>                               bfd_section_vma (osec)));
>            break;
>          }
> -      if (!bfd_set_section_contents (obfd, osec, memhunk.data (),
> -                                    offset, size))
> +
> +      if (!sparse_bfd_set_section_contents (obfd, osec, memhunk.data (),
> +                                           offset, size))
>          {
>            warning (_("Failed to write corefile contents (%s)."),
>                     bfd_errmsg (bfd_get_error ()));
Best,
Lancelot.

next prev parent reply	other threads:[~2024-03-21 21:28 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-03-15 18:27 [PATCH] " Pedro Alves
2024-03-15 18:40 ` Eli Zaretskii
2024-03-18 13:29 ` [PATCH v2] " Pedro Alves
2024-03-18 17:43   ` [PATCH v3] " Pedro Alves
2024-03-19 16:41     ` John Baldwin
2024-03-21 21:27     ` Lancelot SIX [this message]
2024-03-21 23:14       ` [PATCH v4] " Pedro Alves
2024-03-22 10:26         ` Lancelot SIX
2024-03-22 12:33           ` Pedro Alves

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5f504bb2-75b2-43fb-b74f-708cf0c46bb6@amd.com \
    --to=lancelot.six@amd.com \
    --cc=gdb-patches@sourceware.org \
    --cc=pedro@palves.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox