From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gdb-patches-return-36144-listarch-gdb-patches=sources.redhat.com@sources.redhat.com>
Received: (qmail 11180 invoked by alias); 5 Oct 2004 13:48:18 -0000
Mailing-List: contact gdb-patches-help@sources.redhat.com; run by ezmlm
Precedence: bulk
List-Subscribe: <mailto:gdb-patches-subscribe@sources.redhat.com>
List-Archive: <http://sources.redhat.com/ml/gdb-patches/>
List-Post: <mailto:gdb-patches@sources.redhat.com>
List-Help: <mailto:gdb-patches-help@sources.redhat.com>, <http://sources.redhat.com/ml/#faqs>
Sender: gdb-patches-owner@sources.redhat.com
Received: (qmail 11127 invoked from network); 5 Oct 2004 13:48:09 -0000
Received: from unknown (HELO nevyn.them.org) (66.93.172.17)
  by sourceware.org with SMTP; 5 Oct 2004 13:48:09 -0000
Received: from drow by nevyn.them.org with local (Exim 4.34 #1 (Debian))
	id 1CEpfk-0003QV-I1; Tue, 05 Oct 2004 09:48:08 -0400
Date: Tue, 05 Oct 2004 13:48:00 -0000
From: Daniel Jacobowitz <drow@false.org>
To: Jim Blandy <jimb@redhat.com>
Cc: gdb-patches@sources.redhat.com
Subject: Re: [rfa/dwarf] Support for attributes pointing to a different CU
Message-ID: <20041005134808.GA11252@nevyn.them.org>
Mail-Followup-To: Jim Blandy <jimb@redhat.com>,
	gdb-patches@sources.redhat.com
References: <20040923045723.GA11871@nevyn.them.org> <vt2u0tobpf1.fsf@zenia.home> <20040924003412.GB10500@nevyn.them.org> <vt2is9xx8wg.fsf@zenia.home> <20041003161221.GA3234@nevyn.them.org> <vt2hdpa2nkf.fsf@zenia.home> <20041004212201.GA21064@nevyn.them.org> <vt23c0t3gdh.fsf@zenia.home>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <vt23c0t3gdh.fsf@zenia.home>
User-Agent: Mutt/1.5.5.1+cvs20040105i
X-SW-Source: 2004-10/txt/msg00083.txt.bz2

On Tue, Oct 05, 2004 at 12:04:26AM -0500, Jim Blandy wrote:
> The only advantage I had in mind was simplicity, and it didn't seem
> like it'd be a performance hit.
> 
> The libiberty hash table expands by a given ratio each time, which
> means that, overall, the number of rehashings per element is constant
> no matter how large the table gets.  It's similar to the analysis that
> shows that ye olde buffer doubling trick ends up being linear time.
> (I'm thinking on my own here, not quoting anyone, so be critical...)
> 
> There could be a locality disadvantage to doing it all in one big hash
> table.  When the time comes to restore a given CU's types, their table
> entries will be sharing cache blocks with those of other, irrelevant
> CU's.  That doesn't happen if we use for per-CU hash tables: table
> entries will never share cache blocks with table entries for other
> CU's (assuming the tail of one table doesn't share a block with the
> head of another, blah blah blah...).
> 
> I'm concerned about the legacy of complexity we'll leave.  Wrinkles
> should prove they can pay their own way.  :)

Then there's only one thing to do... I'll time it.

Using a per-objfile type_hash saves a little memory in overhead, and
probably a little more in hash table size - I didn't instrument memory
use.  But it's definitely slower.  From 1% to 4.3% depending on the
test case.

I believe this happens because we can create the per-comp-unit hash
tables at the correct size - I use a heuristic based on the size of the
CU, although by this point I could use a more accurate one based on the
number of DIEs if I thought it would be worthwhile.  If we create a
per-objfile CU, then we don't get this benefit, so we do a lot of
copying.  There's also the locality benefit.

Also, it saves no code - unless you see something I'm missing, it was
basically s/cu->per_cu->type_hash/dwarf2_per_objfile->type_hash/.  So,
OK with the per-comp-unit hash?

-- 
Daniel Jacobowitz