From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gdb-return-21078-listarch-gdb=sources.redhat.com@sources.redhat.com>
Received: (qmail 2711 invoked by alias); 30 Mar 2005 20:10:13 -0000
Mailing-List: contact gdb-help@sources.redhat.com; run by ezmlm
Precedence: bulk
List-Subscribe: <mailto:gdb-subscribe@sources.redhat.com>
List-Archive: <http://sources.redhat.com/ml/gdb/>
List-Post: <mailto:gdb@sources.redhat.com>
List-Help: <mailto:gdb-help@sources.redhat.com>, <http://sources.redhat.com/ml/#faqs>
Sender: gdb-owner@sources.redhat.com
Received: (qmail 2677 invoked from network); 30 Mar 2005 20:10:05 -0000
Received: from unknown (HELO romy.inter.net.il) (192.114.186.66)
  by sourceware.org with SMTP; 30 Mar 2005 20:10:05 -0000
Received: from zaretski (IGLD-80-230-68-88.inter.net.il [80.230.68.88])
	by romy.inter.net.il (MOS 3.5.6-GR)
	with ESMTP id AWH71620 (AUTH halo1);
	Wed, 30 Mar 2005 22:09:59 +0200 (IST)
Date: Wed, 30 Mar 2005 20:10:00 -0000
From: "Eli Zaretskii" <eliz@gnu.org>
To: GDB <gdb@sources.redhat.com>
Message-ID: <01c53564$Blat.v2.4$1da3c140@zahav.net.il>
Content-Transfer-Encoding: 7BIT
Content-Type: text/plain; charset=ISO-8859-1
In-reply-to: <20050329214414.GA3498@nevyn.them.org> (message from Daniel
	Jacobowitz on Tue, 29 Mar 2005 16:44:14 -0500)
Subject: Re: [mi] watchpoint-scope exec async command
Reply-to: Eli Zaretskii <eliz@gnu.org>
References: <01c53207$Blat.v2.4$3def9b00@zahav.net.il> <20050328225619.GB3413@white> <20050328224101.GA629@nevyn.them.org> <20050328235310.GA3699@white> <20050328230048.GA1697@nevyn.them.org> <20050329014203.GB3801@white> <20050329013634.GB6373@nevyn.them.org> <20050329024945.GC3957@white> <20050329020123.GA7266@nevyn.them.org> <01c534a6$Blat.v2.4$944e44a0@zahav.net.il> <20050329214414.GA3498@nevyn.them.org>
X-SW-Source: 2005-03/txt/msg00305.txt.bz2

> Date: Tue, 29 Mar 2005 16:44:14 -0500
> From: Daniel Jacobowitz <drow@false.org>
> Cc: GDB <gdb@sources.redhat.com>
> 
> If the scope breakpoint triggers, we delete it.  From watch_command_1:
>           /* Automatically delete the breakpoint when it hits.  */
>           scope_breakpoint->disposition = disp_del;
> 
> That's what's happening in this case.  Then, shortly thereafter, the
> watchpoint triggers.  That's when we detect that it has gone out of
> scope, and set it to delete at next stop; and we crash, because we
> already deleted the scope breakpoint when it was hit.

I hoped to see this from Bob's tracebacks, but I only saw the first
part of what you describe: that the scope breakpoint is being deleted
after it triggers (not _when_, _after_: it is deleted by
breakpoint_auto_delete).

Assuming that the watchpoint triggers after that, it is marked as
disp_del_at_next_stop, so it would be slated for deletion by the same
breakpoint_auto_delete function when it is called shortly after.  This
is the part that I didn't see in Bob's session.  I will assume that
things indeed happen like you say: that when we try to delete that
watchpoint, we crash when we access its scope breakpoint, which was
already deleted and freed.

I think we have the following alternatives to fix this.  First, we
could stop using scope breakpoints for hardware-assisted watchpoints.
(The scope breakpoint is not needed in this case, since they don't
slow down the executable, and because we have an independent facility
to detect that a hardware watchpoint went out of scope: that is the
code run by insert_bp_location and watchpoint_check which prints a
warning about the fact that the watchpoint went out of scope.)
Software watchpoints do need the scope breakpoint (to stop
single-stepping the inferior once the watchpoint goes out of scope),
and in that case Bob's testing demonstrates that the scope breakpoint
machinery works correctly.  So we need to continue using scope
breakpoints for software watchpoints alone.

If we don't arrange a scope breakpoint for a hardware watchpoint, we
won't hit the problem Bob reported.

The second alternative is to treat scope breakpoints specially in
breakpoint_auto_delete: when we see a scope breakpoint that is marked
for deletion, we will have to find its watchpoint, and if that
watchpoint is a hardware watchpoint, we will have to delete that
watchpoint as well.

I like the first alternative better, since it seems cleaner.

As an aside, I'd ask Bob to run the same test program, but this time
use awatch instead of watch command.  I'd be interested to hear if the
same problems (i.e. memory write into a freed block reported by
valgrind and an occasional crash) happen in that case as well.  The
reason that I'm asking this is that we handle watch and rwatch/awatch
slightly differently, since the code that handles watch is run for
both software and hardware watchpoints.

> All the code surrounding bp_none is horribly bogus; we're walking freed
> memory if we hit it.

That's true, it sounds as if someone was not sure what our code does,
so he/she went over-defensive.

> I don't know if it still triggers today

I think Bob's testing shows that it does, for the software
watchpoints.

Did I help resolving this issue?