Mirror of the gdb mailing list
 help / color / mirror / Atom feed
* Re: Linux threads support in GDB
       [not found] <200009140024.e8E0OCK13917@mbox.wins.uva.nl>
@ 2000-10-02  5:57 ` Mark Kettenis
  2000-10-02 16:15   ` gdb
  2000-10-04 17:12   ` gdb
  0 siblings, 2 replies; 4+ messages in thread
From: Mark Kettenis @ 2000-10-02  5:57 UTC (permalink / raw)
  To: gdb; +Cc: gdb

   Date: Wed, 13 Sep 2000 18:25:21 -0600 (MDT)
   From: gdb@dontknow.org

Appologies for the this late reply.  Meanwhile, the particular failing
assertions you were seeing should be gone.

   Ok, here is a test case that will reproduce the problem (to some
   extent anyways). I guess for some of my testing trying to reproduce
   the problem with something similiar I was using GDB 5.0, not the
   cvs version. However that version also has problems with the
   following code.) Normally when I run the real application I have
   SIGSTOP and SIGCONT set to nostop, which may be why it isn't seen
   until a breakpoint is used, whereas here it happens after the
   implicit SIGCONT breakpoint.

Hmm.  GDB heavily relies on SIGSTOP itself.  There is no other way to
stop threads in Linux :-(.  Due to the nature of signals (particularly
the fact that multiple signals might be merged into one), this rather
hoplessly interferes with the SIGSTOPs you're using in your program.
I really don't intend to fix this.  Sorry.

Mark

PS Please keep CC'ing the GDB mailing list.  These discussions could
be useful for other people too.
From Peter.Schauer@regent.e-technik.tu-muenchen.de Mon Oct 02 10:04:00 2000
From: "Peter.Schauer" <Peter.Schauer@regent.e-technik.tu-muenchen.de>
To: gdb@sourceware.cygnus.com
Cc: ezannoni@cygnus.com
Subject: Problem with 2000-05-05 elfread.c change
Date: Mon, 02 Oct 2000 10:04:00 -0000
Message-id: <200010021704.TAA02459@reisser.regent.e-technik.tu-muenchen.de>
X-SW-Source: 2000-10/msg00002.html
Content-length: 2054

Problem with 2000-05-05 elfread.c change

This change:

2000-05-05  Elena Zannoni  <ezannoni@kwikemart.cygnus.com>

        * elfread.c (elf_symtab_read): The calculation of 'offset' 
        must be done for each symbol, not just once. The index 
        used must be the index of the section where 'sym' resides,
        not .text.

causes subtle and difficult to track down problems with some gcc compiled C++
executables.
These executables have a very large number of .gnu.linkonce sections
when using native ld on Solaris (I have not yet tried GNU ld, but GDB
should work with executables generated with native ld).
As an example, testsuite/gdb.c++/virtfunc contains 95 sections, our large
C++ application 2750 sections.

The large number of sections causes an indexing beyond the bounds of the
offsets array in the ANOFFSET macro, which does no bounds checking.
So intermittent garbage is obtained for the offset value, causing
symbols to obtain the wrong address.


I hope that you have access to a GNU Solaris toolchain using native ld.

You should then be able to verify the problem by applying the ANOFFSET
bounds checking kludge below (can't use SECT_OFF_MAX, because not all users
of ANOFFSET include gdb-stabs.h).

Run the resulting GDB on testsuite/gdb.c++/virtfunc and watch it die...

*** ./symtab.h.orig	Fri Sep 15 21:27:33 2000
--- ./symtab.h	Fri Sep 29 23:22:48 2000
***************
*** 831,837 ****
  
  #define	ANOFFSET(secoff, whichone) \
     ((whichone == -1) ? \
!     (internal_error ("Section index is uninitialized"), -1) : secoff->offsets[whichone])
  
  /* The maximum possible size of a section_offsets table.  */
  
--- 831,840 ----
  
  #define	ANOFFSET(secoff, whichone) \
     ((whichone == -1) ? \
!     (internal_error ("Section index is uninitialized"), -1) : \
!      ((whichone >= 40) ? \
!       (internal_error ("Section index out of bounds"), -1) : \
!        secoff->offsets[whichone]))
  
  /* The maximum possible size of a section_offsets table.  */

-- 
Peter Schauer			pes@regent.e-technik.tu-muenchen.de
From gdb@dontknow.org Mon Oct 02 16:00:00 2000
From: gdb@dontknow.org
To: Mark Kettenis <kettenis@wins.uva.nl>
Cc: gdb@sourceware.cygnus.com
Subject: Re: Linux threads support in GDB
Date: Mon, 02 Oct 2000 16:00:00 -0000
Message-id: <b919ec67e876ff043717c4ae09926084@NO-ID-FOUND.mhonarc.org>
References: <200010021257.e92CvRm16644@debye.wins.uva.nl>
X-SW-Source: 2000-10/msg00003.html
Content-length: 2178

On 02-Oct-00 Mark Kettenis wrote:
>    Date: Wed, 13 Sep 2000 18:25:21 -0600 (MDT)
>    From: gdb@dontknow.org
> 
> Appologies for the this late reply.  Meanwhile, the particular failing
> assertions you were seeing should be gone.
> 
>    Ok, here is a test case that will reproduce the problem (to some
>    extent anyways). I guess for some of my testing trying to reproduce
>    the problem with something similiar I was using GDB 5.0, not the
>    cvs version. However that version also has problems with the
>    following code.) Normally when I run the real application I have
>    SIGSTOP and SIGCONT set to nostop, which may be why it isn't seen
>    until a breakpoint is used, whereas here it happens after the
>    implicit SIGCONT breakpoint.
> 
> Hmm.  GDB heavily relies on SIGSTOP itself.  There is no other way to
> stop threads in Linux :-(.  Due to the nature of signals (particularly
> the fact that multiple signals might be merged into one), this rather
> hoplessly interferes with the SIGSTOPs you're using in your program.
> I really don't intend to fix this.  Sorry.

And my application requires the occassional use of SIGSTOP for stopping threads
(as you said, there is no other way to do it, at least not that I can find.)
However I am not exactly sure what the cause of the problem is. Especially
considering that a signal is sent to a single thread and not all threads (with
the possible exceptions of the main thread or signals from the shell). This
behavior is where Linux is different than standard PTHREADS (otherwise the use
of SIGSTOP would be a bad thing for my application.)

Another problem that I am seeing with the threading is that exiting threads
are ending up defunct rather than going away while using GDB (correct behavior
is seen w/o GDB). Is SIGCHLD is being handled correctly? IIRC that is the cause
of that.

> Mark
> 
> PS Please keep CC'ing the GDB mailing list.  These discussions could
> be useful for other people too.

Ok, I'll try to remember. I'm subscribed to the gdb-bug list but not the gdb
list.


Contrary to popular belief, UNIX is user friendly.
  It just happens to be selective about who it makes friends with.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Linux threads support in GDB
  2000-10-02  5:57 ` Linux threads support in GDB Mark Kettenis
@ 2000-10-02 16:15   ` gdb
  2000-10-04 17:12   ` gdb
  1 sibling, 0 replies; 4+ messages in thread
From: gdb @ 2000-10-02 16:15 UTC (permalink / raw)
  To: Mark Kettenis; +Cc: gdb

On 02-Oct-00 Mark Kettenis wrote:
>    Date: Wed, 13 Sep 2000 18:25:21 -0600 (MDT)
>    From: gdb@dontknow.org
> 
> Appologies for the this late reply.  Meanwhile, the particular failing
> assertions you were seeing should be gone.

I actually tested it, and it gets rid of the assertion, though the application
still exits when the SIGALRM happens while the processes is STOP'ed. Is there
any way to queue up the signals and send them upon program/thread resumption?

I did however realize that part of what I was missing about the problem is
that all threads are stopped during program breakpoints. So I guess that I can
kinda see how signals happening while the program is stopped. Can the itimer be
suspended somehow when trying to stop all threads? (Yes I know, timers are only
one possible cause, and finding all causes would be impossible since user
applications could cause the problem, however itimer might be the most common.
And something special must be happening for SIGUSR1 which pthread uses, but I
haven't looked at too many of the GDB threading internals.)

>    Ok, here is a test case that will reproduce the problem (to some
>    extent anyways). I guess for some of my testing trying to reproduce
>    the problem with something similiar I was using GDB 5.0, not the
>    cvs version. However that version also has problems with the
>    following code.) Normally when I run the real application I have
>    SIGSTOP and SIGCONT set to nostop, which may be why it isn't seen
>    until a breakpoint is used, whereas here it happens after the
>    implicit SIGCONT breakpoint.
> 
> Hmm.  GDB heavily relies on SIGSTOP itself.  There is no other way to
> stop threads in Linux :-(.  Due to the nature of signals (particularly
> the fact that multiple signals might be merged into one), this rather
> hoplessly interferes with the SIGSTOPs you're using in your program.
> I really don't intend to fix this.  Sorry.
> 
> Mark
> 
> PS Please keep CC'ing the GDB mailing list.  These discussions could
> be useful for other people too.



Contrary to popular belief, UNIX is user friendly.
  It just happens to be selective about who it makes friends with.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Linux threads support in GDB
  2000-10-02  5:57 ` Linux threads support in GDB Mark Kettenis
  2000-10-02 16:15   ` gdb
@ 2000-10-04 17:12   ` gdb
  1 sibling, 0 replies; 4+ messages in thread
From: gdb @ 2000-10-04 17:12 UTC (permalink / raw)
  To: Mark Kettenis; +Cc: gdb

> Explicitly stopping individual threads is considered bad practice by
> most threads experts.  And of course using SIGSTOP is unportable.
> That's why I don't consider this a critical problem.

I can believe that it is considered bad, and right now the only implementation
of thread stopping is used when creating a new thread in certain instances
(Some applications expect threads to be created in a stopped state and use
SIGSTOP to do so. I could implement support for this differently, waiting on a
condition until the resume is given, and if there is a stop/resume pair used
outside of this case then it would break GDB, but at this time it isn't used.)

One application that uses SIGSTOP for suspending a running thread is Sun's J2SE.

> I'd be delighted if you can find a way to make things work.  I just
> don't see how.

If I get enough time to do so, I will think on the problem some more (so far
I've only done brief poking around.)

--Randy

Contrary to popular belief, UNIX is user friendly.
  It just happens to be selective about who it makes friends with.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Linux threads support in GDB
       [not found] <200010050012.CAA26158@mailhost.ri.silicomp.fr>
@ 2000-10-04 22:50 ` Eric Paire
  0 siblings, 0 replies; 4+ messages in thread
From: Eric Paire @ 2000-10-04 22:50 UTC (permalink / raw)
  To: gdb; +Cc: Mark Kettenis, gdb

> One application that uses SIGSTOP for suspending a running thread is Sun's J2SE.
> 
Two years ago, we ported the Sun JDK-1.1.6 on native LinuxThreads and
did *NOT* use SIGSTOP(because, if I remember correctly, this schema
was not sufficient to insure correct thread synchronization on
multiprocessors); at the same time, I implemented the very first
version of the LinuxThread support within GDB-4.17 to debug it,
and it worked !

-Eric
+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+ Eric PAIRE
Web  : http://www.ri.silicomp.com/~paire  | Groupe SILICOMP - Research Institute
Email: eric.paire@ri.silicomp.com         | 2, avenue de Vignate
Phone: +33 (0) 476 63 48 71               | F-38610 Gieres
Fax  : +33 (0) 476 51 05 32               | FRANCE


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2000-10-04 22:50 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <200009140024.e8E0OCK13917@mbox.wins.uva.nl>
2000-10-02  5:57 ` Linux threads support in GDB Mark Kettenis
2000-10-02 16:15   ` gdb
2000-10-04 17:12   ` gdb
     [not found] <200010050012.CAA26158@mailhost.ri.silicomp.fr>
2000-10-04 22:50 ` Eric Paire

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox