From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from simark.ca by simark.ca with LMTP id NkCkGBZ5lGl96z4AWB0awg (envelope-from ) for ; Tue, 17 Feb 2026 09:20:06 -0500 Authentication-Results: simark.ca; dkim=pass (1024-bit key; unprotected) header.d=suse.de header.i=@suse.de header.a=rsa-sha256 header.s=susede2_rsa header.b=iTqR4RQ9; dkim=pass header.d=suse.de header.i=@suse.de header.a=ed25519-sha256 header.s=susede2_ed25519 header.b=WfUQ0QJl; dkim=pass (1024-bit key) header.d=suse.de header.i=@suse.de header.a=rsa-sha256 header.s=susede2_rsa header.b=iTqR4RQ9; dkim=neutral header.d=suse.de header.i=@suse.de header.a=ed25519-sha256 header.s=susede2_ed25519 header.b=WfUQ0QJl; dkim-atps=neutral Received: by simark.ca (Postfix, from userid 112) id 48E8F1E089; Tue, 17 Feb 2026 09:20:06 -0500 (EST) X-Spam-Checker-Version: SpamAssassin 4.0.1 (2024-03-25) on simark.ca X-Spam-Level: X-Spam-Status: No, score=-2.4 required=5.0 tests=ARC_SIGNED,ARC_VALID,BAYES_00, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,MAILING_LIST_MULTI, RCVD_IN_DNSWL_MED,RCVD_IN_VALIDITY_CERTIFIED_BLOCKED, RCVD_IN_VALIDITY_RPBL_BLOCKED,RCVD_IN_VALIDITY_SAFE_BLOCKED autolearn=ham autolearn_force=no version=4.0.1 Received: from vm01.sourceware.org (vm01.sourceware.org [38.145.34.32]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange x25519 server-signature ECDSA (prime256v1) server-digest SHA256) (No client certificate requested) by simark.ca (Postfix) with ESMTPS id 4F2771E089 for ; Tue, 17 Feb 2026 09:20:04 -0500 (EST) Received: from vm01.sourceware.org (localhost [127.0.0.1]) by sourceware.org (Postfix) with ESMTP id 273D44B9DB78 for ; Tue, 17 Feb 2026 14:20:03 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 273D44B9DB78 Authentication-Results: sourceware.org; dkim=pass (1024-bit key, unprotected) header.d=suse.de header.i=@suse.de header.a=rsa-sha256 header.s=susede2_rsa header.b=iTqR4RQ9; dkim=pass header.d=suse.de header.i=@suse.de header.a=ed25519-sha256 header.s=susede2_ed25519 header.b=WfUQ0QJl; dkim=pass (1024-bit key) header.d=suse.de header.i=@suse.de header.a=rsa-sha256 header.s=susede2_rsa header.b=iTqR4RQ9; dkim=neutral header.d=suse.de header.i=@suse.de header.a=ed25519-sha256 header.s=susede2_ed25519 header.b=WfUQ0QJl Received: from smtp-out1.suse.de (smtp-out1.suse.de [IPv6:2a07:de40:b251:101:10:150:64:1]) by sourceware.org (Postfix) with ESMTPS id 6F0BC4BA23C3 for ; Tue, 17 Feb 2026 14:19:33 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 6F0BC4BA23C3 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=suse.de Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.de ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 6F0BC4BA23C3 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=2a07:de40:b251:101:10:150:64:1 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1771337973; cv=none; b=IC4Wv+93p+0XMYA0UQAJrVNi3nm6ZfBEyd058WOxgTFQt+VcqLN50uXruuR8Dg7eGjb5PlXsEjVs9Rv5GuPGzl6CmhKCiczbY8DMwWn9H4Z9GURuB8NHGaUzD77eXw0GFWNB2PItLCbofgDl+pDBDmw+swPo4fB0y6mCB8ViAL0= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1771337973; c=relaxed/simple; bh=d94QL989e7JyXmRi1BNYsQc1HIHO0VGQQHMKxbhBV74=; h=DKIM-Signature:DKIM-Signature:DKIM-Signature:DKIM-Signature: Message-ID:Date:MIME-Version:Subject:To:From; b=ppmIDvhYWDYK2Hilc2u3+PJuqQ7a8QT5rxRlDrxuCuZGa8KAY2DHc4W5+ZpE/i8B2Amv76yGfbAeJ6FME69oKaXr02VWtwe6CFpWb1SlxZo4DRGQKsEeHKquVgWAEQj5qHXxnnPlRoh/Sv3yjc7qQyWREH54hHxYPPey2FYsFeY= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 6F0BC4BA23C3 Received: from imap1.dmz-prg2.suse.org (imap1.dmz-prg2.suse.org [IPv6:2a07:de40:b281:104:10:150:64:97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id 4F8173E6EF; Tue, 17 Feb 2026 14:19:32 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1771337972; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=ZfdFxC9phYk3YfgirpeWb6Zi1OBvi5tcA0UE9j7PCuw=; b=iTqR4RQ9SdKQnBiYDamJhpNvpgS8yqhm87EPuWpHsqHP+/OdDgNWhYP7IbuNjh1QoAiAp5 LQcE6KG1Ixg5fQJLxp5UUetb4vMQSII8JOBXZM3USOyhSkEPp6Y88oHkjUFMIembZx0KmE pUzEJLDf0Hsb1zaiq04dJjfCTFjBOlM= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1771337972; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=ZfdFxC9phYk3YfgirpeWb6Zi1OBvi5tcA0UE9j7PCuw=; b=WfUQ0QJllgofh+Cp8Mu1q5sR+CR8jNBVDJFuxjp+OSUUe59mPTTK3tcvqHmI3QdzgWRSeh sPNWEuBYYza1voAg== Authentication-Results: smtp-out1.suse.de; dkim=pass header.d=suse.de header.s=susede2_rsa header.b=iTqR4RQ9; dkim=pass header.d=suse.de header.s=susede2_ed25519 header.b=WfUQ0QJl DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1771337972; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=ZfdFxC9phYk3YfgirpeWb6Zi1OBvi5tcA0UE9j7PCuw=; b=iTqR4RQ9SdKQnBiYDamJhpNvpgS8yqhm87EPuWpHsqHP+/OdDgNWhYP7IbuNjh1QoAiAp5 LQcE6KG1Ixg5fQJLxp5UUetb4vMQSII8JOBXZM3USOyhSkEPp6Y88oHkjUFMIembZx0KmE pUzEJLDf0Hsb1zaiq04dJjfCTFjBOlM= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1771337972; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=ZfdFxC9phYk3YfgirpeWb6Zi1OBvi5tcA0UE9j7PCuw=; b=WfUQ0QJllgofh+Cp8Mu1q5sR+CR8jNBVDJFuxjp+OSUUe59mPTTK3tcvqHmI3QdzgWRSeh sPNWEuBYYza1voAg== Received: from imap1.dmz-prg2.suse.org (localhost [127.0.0.1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by imap1.dmz-prg2.suse.org (Postfix) with ESMTPS id 2DD063EA65; Tue, 17 Feb 2026 14:19:32 +0000 (UTC) Received: from dovecot-director2.suse.de ([2a07:de40:b281:106:10:150:64:167]) by imap1.dmz-prg2.suse.org with ESMTPSA id 0mPNCfR4lGlMOgAAD6G6ig (envelope-from ); Tue, 17 Feb 2026 14:19:32 +0000 Message-ID: Date: Tue, 17 Feb 2026 15:19:31 +0100 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v2] gcore: Handle unreadable pages within readable memory regions To: Kevin Buettner , gdb-patches@sourceware.org References: <20260212194039.1717054-1-kevinb@redhat.com> Content-Language: en-US From: Tom de Vries In-Reply-To: <20260212194039.1717054-1-kevinb@redhat.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spamd-Result: default: False [-4.51 / 50.00]; BAYES_HAM(-3.00)[100.00%]; NEURAL_HAM_LONG(-1.00)[-1.000]; R_DKIM_ALLOW(-0.20)[suse.de:s=susede2_rsa,suse.de:s=susede2_ed25519]; NEURAL_HAM_SHORT(-0.20)[-1.000]; MIME_GOOD(-0.10)[text/plain]; MX_GOOD(-0.01)[]; FUZZY_RATELIMITED(0.00)[rspamd.com]; RCVD_VIA_SMTP_AUTH(0.00)[]; ARC_NA(0.00)[]; RBL_SPAMHAUS_BLOCKED_OPENRESOLVER(0.00)[2a07:de40:b281:104:10:150:64:97:from]; RECEIVED_SPAMHAUS_BLOCKED_OPENRESOLVER(0.00)[2a07:de40:b281:106:10:150:64:167:received]; TO_DN_SOME(0.00)[]; MIME_TRACE(0.00)[0:+]; RCVD_TLS_ALL(0.00)[]; MID_RHS_MATCH_FROM(0.00)[]; RCPT_COUNT_TWO(0.00)[2]; RCVD_COUNT_TWO(0.00)[2]; FROM_EQ_ENVFROM(0.00)[]; FROM_HAS_DN(0.00)[]; SPAMHAUS_XBL(0.00)[2a07:de40:b281:104:10:150:64:97:from]; DBL_BLOCKED_OPENRESOLVER(0.00)[imap1.dmz-prg2.suse.org:rdns,imap1.dmz-prg2.suse.org:helo,suse.de:dkim,suse.de:mid,suse.de:email]; DNSWL_BLOCKED(0.00)[2a07:de40:b281:104:10:150:64:97:from,2a07:de40:b281:106:10:150:64:167:received]; TO_MATCH_ENVRCPT_ALL(0.00)[]; DKIM_SIGNED(0.00)[suse.de:s=susede2_rsa,suse.de:s=susede2_ed25519]; DKIM_TRACE(0.00)[suse.de:+] X-Rspamd-Action: no action X-Rspamd-Queue-Id: 4F8173E6EF X-Rspamd-Server: rspamd1.dmz-prg2.suse.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gdb-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gdb-patches-bounces~public-inbox=simark.ca@sourceware.org On 2/12/26 8:39 PM, Kevin Buettner wrote: > GLIBC 2.42 changed how thread stack guard pages are implemented [2]. > In GLIBC 2.41 and earlier, guard pages were set up using mprotect() to > mark guard regions with no permissions. Once configured, guard pages > were visible as separate entries in /proc/PID/maps with no permissions > (i.e. they're inaccessible). In GLIBC 2.42, guard pages are > installed using the kernel's MADV_GUARD_INSTALL mechanism [1], which > marks them at the page table entry (PTE) level within the existing > mapping. > > As a consequence, guard pages do not appear as separate entries in > /proc/PID/maps, but remain as part of the containing mapping. Moreover, > thread stacks from multiple mmap() calls may be merged into a single > virtual memory area (VMA) with read and write permissions since there's > no guard page VMA to separate them. These guard pages cannot be > distinguished by examining VMA listings but do return EIO when read > from /proc/PID/mem. > > GDB's gcore code reads /proc/PID/smaps to discover memory regions and > creates one BFD section per mapping. (On linux, this is performed in > linux_find_memory_regions_full in linux-tdep.c.) With the old layout, > memory areas with guard pages appeared separately with no permissions, > which were filtered out. Each thread stack became its own section > containing only readable data. With the new layout, using > MADV_GUARD_INSTALL instead of the older mechanism, it's often the case > that thread stacks created with multiple calls to mmap() are exposed > as a single mapping appearing in /proc/PID/smaps with read and write > permissions. Should that happen, GDB's code creates a single section > covering all thread stacks and their guard pages. (Even if each > thread stack appears in its own mapping, the fact remains that there > will be an inaccessible portion of the mapping. When one or more > thread stacks are coalesced into a single mapping, there will be > several inaccessible "holes" representing the guard pages.) > > When gcore_copy_callback copies section contents, it reads memory in > 1MB (MAX_COPY_BYTES) chunks. If any page in the chunk is a guard page, > the call to target_read_memory() fails. The old code responded by > breaking out of the copy loop, abandoning the entire section. This > prevents correct copying of thread stack data, resulting in core files > with zero-filled thread stacks, resulting in nearly empty backtraces. > > Fix this by falling back to page-by-page reading when a 1MB chunk read > fails. Individual pages that cannot be read are filled with zeros, > allowing the remaining readable memory to be captured. > > I also considered a simpler change using the value of > FALLBACK_PAGE_SIZE (4096) as the read size instead of MAX_COPY_BYTES > (1MB). This would avoid the fallback logic but would cause up to 256x > more syscalls. The proposed approach also allows meaningful warnings: > we warn only if an entire region is unreadable (indicating a real > problem), whereas per-page reads would make it harder to distinguish > guard page failures from actual errors. Since guard pages are at > offset 0 for downward-growing stacks, a large target_read_memory() > fails early at the first unreadable byte anyway. > > With this fix, I see 16 failures resolved in the following test cases: > > gdb.ada/task_switch_in_core.exp > gdb.arch/i386-tls-regs.exp > gdb.threads/threadcrash.exp > gdb.threads/tls-core.exp > > Looking at just one of these, from gdb.log without the fix, I see: > > thread apply 5 backtrace > > Thread 5 (LWP 3414829): > #0 0x00007ffff7d1d982 in __syscall_cancel_arch () from /lib64/libc.so.6 > #1 0x0000000000000000 in ?? () > (gdb) FAIL: gdb.threads/threadcrash.exp: test_gcore: thread apply 5 backtrace > > And this is what it looks like with the fix in place (some paths have > been shortened): > > thread apply 5 backtrace > > Thread 5 (Thread 0x7fffeffff6c0 (LWP 1282651) "threadcrash"): > #0 0x00007ffff7d1d982 in __syscall_cancel_arch () from /lib64/libc.so.6 > #1 0x00007ffff7d11c3c in __internal_syscall_cancel () from /lib64/libc.so.6 > #2 0x00007ffff7d61b62 in clock_nanosleep@GLIBC_2.2.5 () from /lib64/libc.so.6 > #3 0x00007ffff7d6db37 in nanosleep () from /lib64/libc.so.6 > #4 0x00007ffff7d8008e in sleep () from /lib64/libc.so.6 > #5 0x00000000004006a8 in do_syscall_task (location=NORMAL) at threadcrash.c:158 > #6 0x0000000000400885 in thread_function (arg=0x404340) at threadcrash.c:277 > #7 0x00007ffff7d15464 in start_thread () from /lib64/libc.so.6 > #8 0x00007ffff7d985ac in __clone3 () from /lib64/libc.so.6 > (gdb) PASS: gdb.threads/threadcrash.exp: test_live_inferior: thread apply 5 backtrace > > Regression testing on Fedora 42 (glibc 2.41) shows no new failures. > Hi Kevin, I'm seeing the same failures on openSUSE Tumbleweed, and I've done a full test run with the patch, and indeed it fixes all those failures and causes no regression. Thank you for fixing this, and the clear and detailed explanation. I've reviewed the patch, and it LGTM. [ FWIW, I'm wondering if we could use "target_auxv_search (AT_PAGESZ, &page_size)" as a way to improve on the default value, which itself looks good to me. Perhaps as a follow-up patch? ] Approved-By: Tom de Vries Thanks, - Tom > The v1 patch used SPARSE_BLOCK_SIZE as the fallback size. While it > was the correct size, it's used for an entirely different purpose > elsewhere in this file. This v2 commit introduces the constant > FALLBACK_PAGE_SIZE instead. > > References: > > [1] Linux commit 662df3e5c376 ("mm: madvise: implement lightweight > guard page mechanism") > https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=662df3e5c37666d6ed75c88098699e070a4b35b5 > [2] glibc commit a6fbe36b7f31 ("nptl: Add support for setup guard > pages with MADV_GUARD_INSTALL") > https://sourceware.org/git/?p=glibc.git;a=commit;h=a6fbe36b7f31292981422692236465ab56670ea9 > > Claude Opus 4.5 and GLM 4.7 assisted with the development of this commit. > > Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=33855 > --- > gdb/gcore.c | 52 ++++++++++++++++++++++++++++++++++++++++++++-------- > 1 file changed, 44 insertions(+), 8 deletions(-) > > diff --git a/gdb/gcore.c b/gdb/gcore.c > index 5a3ad145d4c..7bf2f00e866 100644 > --- a/gdb/gcore.c > +++ b/gdb/gcore.c > @@ -743,6 +743,12 @@ sparse_bfd_set_section_contents (bfd *obfd, asection *osec, > return true; > } > > +/* Fallback page size to use when target_read_memory fails when attempting > + to read MAX_COPY_BYTES in gcore_copy_callback. 4KB is the correct size > + to use for x86 and most other architectures. Some may have larger pages, > + but this size will still work at the cost of more syscalls. */ > +#define FALLBACK_PAGE_SIZE 0x1000 > + > static void > gcore_copy_callback (bfd *obfd, asection *osec) > { > @@ -765,15 +771,45 @@ gcore_copy_callback (bfd *obfd, asection *osec) > if (size > total_size) > size = total_size; > > - if (target_read_memory (bfd_section_vma (osec) + offset, > - memhunk.data (), size) != 0) > + CORE_ADDR vma = bfd_section_vma (osec) + offset; > + > + if (target_read_memory (vma, memhunk.data (), size) != 0) > { > - warning (_("Memory read failed for corefile " > - "section, %s bytes at %s."), > - plongest (size), > - paddress (current_inferior ()->arch (), > - bfd_section_vma (osec))); > - break; > + /* Large read failed. This can happen when the memory region > + contains unreadable pages (such as guard pages embedded within > + a larger mapping). Fall back to reading page by page, filling > + unreadable pages with zeros. */ > + gdb_byte *p = memhunk.data (); > + bfd_size_type remaining = size; > + CORE_ADDR addr = vma; > + bool at_least_one_page_read = false; > + > + while (remaining > 0) > + { > + bfd_size_type chunk_size > + = std::min (remaining, (bfd_size_type) FALLBACK_PAGE_SIZE); > + > + if (target_read_memory (addr, p, chunk_size) != 0) > + { > + /* Failed to read this page. Fill with zeros. This > + handles guard pages and other unreadable regions > + that may exist within a larger readable mapping. */ > + memset (p, 0, chunk_size); > + } > + else > + at_least_one_page_read = true; > + > + p += chunk_size; > + addr += chunk_size; > + remaining -= chunk_size; > + } > + /* Warn only if the entire region was unreadable - this > + indicates a real problem, not just embedded guard pages. */ > + if (!at_least_one_page_read) > + warning (_("Memory read failed for corefile " > + "section, %s bytes at %s."), > + plongest (size), > + paddress (current_inferior ()->arch (), vma)); > } > > if (!sparse_bfd_set_section_contents (obfd, osec, memhunk.data (),