From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 30973 invoked by alias); 2 Aug 2013 01:13:44 -0000 Mailing-List: contact gdb-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: gdb-owner@sourceware.org Received: (qmail 30957 invoked by uid 89); 2 Aug 2013 01:13:44 -0000 X-Spam-SWARE-Status: No, score=-0.2 required=5.0 tests=AWL,BAYES_40,KHOP_THREADED,RCVD_IN_BL_SPAMCOP_NET,RCVD_IN_DNSWL_NONE,RCVD_IN_HOSTKARMA_YE,RDNS_NONE,SPF_SOFTFAIL autolearn=no version=3.3.1 Received: from Unknown (HELO oproxy5.bluehost.com) (67.222.38.55) by sourceware.org (qpsmtpd/0.84/v0.84-167-ge50287c) with SMTP; Fri, 02 Aug 2013 01:13:42 +0000 Received: (qmail 16401 invoked by uid 0); 2 Aug 2013 01:13:35 -0000 Received: from unknown (HELO box531.bluehost.com) (74.220.219.131) by oproxy5.bluehost.com with SMTP; 2 Aug 2013 01:13:35 -0000 Received: from [146.115.71.23] (port=57730 helo=[172.31.1.206]) by box531.bluehost.com with esmtpsa (SSLv3:CAMELLIA256-SHA:256) (Exim 4.80) (envelope-from ) id 1V53vz-0006mv-3J; Thu, 01 Aug 2013 19:13:35 -0600 Message-ID: <1375406012.3028.38.camel@homebase> Subject: Re: Tools to classify / uniquify core dumps or stack traces? From: Paul Smith Reply-To: psmith@gnu.org To: Jan Kratochvil Cc: gdb@sourceware.org Date: Fri, 02 Aug 2013 01:13:00 -0000 In-Reply-To: <20130801193547.GA12116@host2.jankratochvil.net> References: <1375385181.3028.5.camel@homebase> <20130801193547.GA12116@host2.jankratochvil.net> Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-Identified-User: {678:box531.bluehost.com:madscie1:mad-scientist.us} {sentby:smtp auth 146.115.71.23 authed with paul@mad-scientist.us} X-SW-Source: 2013-08/txt/msg00003.txt.bz2 On Thu, 2013-08-01 at 21:35 +0200, Jan Kratochvil wrote: > On Thu, 01 Aug 2013 21:26:21 +0200, Paul Smith wrote: > > Hi all. I've got an environment where I'm getting lots of core dumps > > from various places and it's very tedious to go through them and > > determine which ones are for unique problems, and which are essentially > > duplicates (same bug causing the core dump). > > > > I was thinking of throwing together some kind of Perl or Python script > > that could compare and categorize stack traces, but I thought surely > > someone must have done something like this before. > > > > Anyone have any pointers or thoughts about something like this? > > ABRT, it has several backends how to report the results (the typical one is > Bugzilla), it also supports heuristic duplicates detection etc. > It is shipped in all recent Fedora releases by default and it is also > a project deployable on any OS: > https://fedorahosted.org/abrt/ > > There is also Apport (I do not have experience with it). I don't need to collect the cores (at least not yet), they are being obtained through other methods (which I don't have a lot of control over, currently) and I have bunches of them sitting in a directory. Along with them I have the binary they were generated from, a source tree it was built from, and a sysroot containing the runtime libraries it was invoked with. I can easily script up something to generate GDB stack traces from them. I just need something to reduce/classify them. It looks like btparser is close to what I need. It's an unusual choice, IMHO, to use C to implement something which so fundamentally depends on text manipulation; I would have chosen Perl or Python myself. But I'll definitely take a look. I notice that "satyr" is said to be "the next generation btparser", but from my very brief overview it appears to be more tightly bound up with abrt and the way that abrt packages things, and I'm not using abrt. Thanks for the pointer!