From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 17484 invoked by alias); 9 Feb 2015 23:06:28 -0000 Mailing-List: contact gdb-patches-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: gdb-patches-owner@sourceware.org Received: (qmail 17456 invoked by uid 89); 9 Feb 2015 23:06:27 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.0 required=5.0 tests=AWL,BAYES_00,SPF_HELO_PASS,SPF_PASS,T_RP_MATCHES_RCVD autolearn=ham version=3.3.2 X-HELO: mx1.redhat.com Received: from mx1.redhat.com (HELO mx1.redhat.com) (209.132.183.28) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES256-GCM-SHA384 encrypted) ESMTPS; Mon, 09 Feb 2015 23:06:26 +0000 Received: from int-mx11.intmail.prod.int.phx2.redhat.com (int-mx11.intmail.prod.int.phx2.redhat.com [10.5.11.24]) by mx1.redhat.com (8.14.4/8.14.4) with ESMTP id t19N6Oik005769 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Mon, 9 Feb 2015 18:06:24 -0500 Received: from [127.0.0.1] (ovpn01.gateway.prod.ext.ams2.redhat.com [10.39.146.11]) by int-mx11.intmail.prod.int.phx2.redhat.com (8.14.4/8.14.4) with ESMTP id t19N6MGp022779; Mon, 9 Feb 2015 18:06:23 -0500 Message-ID: <54D93D6E.30806@redhat.com> Date: Mon, 09 Feb 2015 23:06:00 -0000 From: Pedro Alves User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0 MIME-Version: 1.0 To: Doug Evans CC: gdb-patches Subject: Re: [pushed] Improve gdb.threads/attach-many-short-lived-threads.exp timeout handling References: <1423225537-26694-1-git-send-email-palves@redhat.com> In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-SW-Source: 2015-02/txt/msg00201.txt.bz2 On 02/09/2015 09:55 PM, Doug Evans wrote: > On Fri, Feb 6, 2015 at 4:25 AM, Pedro Alves wrote: >> The buildbot shows that this test is still racy, and occasionally >> fails with time outs on some machines. I'd like to get major issues >> with load out of the way. > > Thanks for fixing this. > > btw, how often do you run the testsuite in parallel? > I see such load related issues all the time that way. Yeah, I run the testsuite in parallel all the time, with -j8, on an i7-2620M (2 cores / 4 threads). The attach-many-short-lived-threads.exp never fails for me though. I've now got access to Sergio's build slave, the one that generates the buildbot test results that show the racy failures, and when I run that test manually, in a loop, I never see it fail. OTOH, through build bot the test fails very often... This was last thing last Friday, and I haven't managed to get back to it yet. The test seems to expose more bugs, but the build bot also shows it FAILing in a very odd way sometimes, like failing to attach the very first time. > [e.g., checkpoint.exp, gdb-sigterm.exp, et.al.] Yeah, checkpoint.exp forks a ton of processes, and sometimes I'll hit the "ulimit -u (max user processes)" limit when running in parallel. I bump that "ulimit -u 10000" in the shell that I run tests on, and then it never fails. The gdb-sigterm.exp one should be fixed, I hope, since: https://sourceware.org/ml/gdb-patches/2015-02/msg00151.html Thanks, Pedro Alves