From: Joel Brobecker <brobecker@adacore.com>
To: gdb-patches@sourceware.org
Subject: [RFA] Fix crash on Linux 2.4 when threaded program exits
Date: Wed, 01 Apr 2009 18:22:00 -0000 [thread overview]
Message-ID: <20090401182220.GG16605@adacore.com> (raw)
[-- Attachment #1: Type: text/plain, Size: 2307 bytes --]
The debugger crashes when debugging a threaded program when the program
exits:
(gdb) run
Starting program: /[...]/q
[Thread debugging using libthread_db enabled]
[New Thread 0xb748ebb0 (LWP 9340)]
[New Thread 0xb728abb0 (LWP 9341)]
Test2
Test1
[Thread 0xb748ebb0 (LWP 9340) exited]
[Thread 0xb728abb0 (LWP 9341) exited]
[Thread 0xb75d9b80 (LWP 9337) exited]
Recursive internal problem.
zsh: 9330 abort gdb-head q
It appears that this is only specific to Linux kernels 2.4, and the way
the NPTL behaves on that version of the kernel: With 2.4, we only receive
an "exited" notification for the main thread, whereas with 2.6, we receive
the notification for each and every thread.
What happens in the 2.4 case is that we delete the lp structure for
the thread that exited and then still try to use it shortly after.
At this point, the memory has been free'ed and the contents has been
corrupted. As a result, we hit an internal error that hits another
internal error that causes the abort.
The code in linux-nat.c:linux_nat_filter_event looks like this:
if ((WIFEXITED (status) || WIFSIGNALED (status)) && num_lwps > 1)
{
[delete threads that have vanished]
exit_lwp (lp);
/* If there is at least one more LWP, then the exit signal was
not the end of the debugged application and should be
ignored. */
if (num_lwps > 0)
return NULL;
}
As you can see, in the linux-2.4 case, we end up deleting all threads,
then call exit_lwp to delete the main thread. Next we check num_lwps
which is zero, so we continue. Shortly after that, in the same routine,
we already access lp (around line 2717, "lp->ignore_sigint"), but the
symptoms actually appear slightly later when accessing the lp ptid
in order to set the inferior_ptid which is used to get the associated
inferior.
The fix was to delete the lp and return NULL iff there are other
lwps that still exist.
2009-04-01 Joel Brobecker <brobecker@adacore.com>
* linux-nat.c (linux_nat_filter_events): Do not delete the lwp if
this is the last one.
Tested on x86-linux (with a 2.4.21 Linux kernel). It fixes ~25 failures.
Tested on x86_64-linux (with a 2.6 kernel). No regression.
Does this look correct?
Thanks,
--
Joel
[-- Attachment #2: threads-24.diff --]
[-- Type: text/x-diff, Size: 782 bytes --]
diff --git a/gdb/linux-nat.c b/gdb/linux-nat.c
index be99ece..feca722 100644
--- a/gdb/linux-nat.c
+++ b/gdb/linux-nat.c
@@ -2644,13 +2644,14 @@ linux_nat_filter_event (int lwpid, int status, int options)
"LLW: %s exited.\n",
target_pid_to_str (lp->ptid));
- exit_lwp (lp);
-
- /* If there is at least one more LWP, then the exit signal was
- not the end of the debugged application and should be
- ignored. */
- if (num_lwps > 0)
- return NULL;
+ if (num_lwps > 1)
+ {
+ /* If there is at least one more LWP, then the exit signal
+ was not the end of the debugged application and should be
+ ignored. */
+ exit_lwp (lp);
+ return NULL;
+ }
}
/* Check if the current LWP has previously exited. In the nptl
next reply other threads:[~2009-04-01 18:22 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-04-01 18:22 Joel Brobecker [this message]
2009-04-01 18:42 ` Pedro Alves
2009-04-01 18:58 ` Joel Brobecker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090401182220.GG16605@adacore.com \
--to=brobecker@adacore.com \
--cc=gdb-patches@sourceware.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox