ftmemsim-valgrind

mirror of https://github.com/Zenithsiz/ftmemsim-valgrind.git synced 2026-02-10 05:37:06 +00:00

Author	SHA1	Message	Date
Philippe Waroquiers	0181f813d2	This patch implements reading the directory information for source files in the dwarf3 reader. Basically, the change consists in replacing in the DiInlLoc struct const HChar* filename; /* caller source filename / by UInt fndn_ix; / index in di->fndnpool of caller source dirname/filename / A similar change is done in DiVariable struct, as the read_filename_Table code is shared between the inline info reader and the varinfo reader. Note however that outputting dirname in variable description is not done. Unclear if that is desired or not. It should be trivially doable however. Replacing filename by fndn_ix implies a bunch of semi-mechanical changes. The code to read the directory names is in the new function static XArray read_dirname_xa (struct _DebugInfo* di, const HChar compdir, Cursor c, Bool td3 ) Note that readdwarf.c and readdwarf3.c have significant duplicated logic. Would be nice to integrate these 2 dwarf readers in one single reader. This function is directly inspired from an equivalent piece of code in readdwarf.c. Modified memcheck/tests/varinfo5.vgtest to test the dirname appears in the inlined functions. Impact on memory is neglectable (a few Kb on a big executable). git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14245	2014-08-08 22:11:41 +00:00
Carl Love	98908947c7	This commit is for Bugzilla 334834. The Bugzilla contains patch 2 of 3 to add PPC64 LE support. The other two patches can be found in Bugzillas 334384 and 334836. POWER PC, add the functional Little Endian support, patch 2 The IBM POWER processor now supports both Big Endian and Little Endian. The ABI for Little Endian also changes. Specifically, the function descriptor is not used, the stack size changed, accessing the TOC changed. Functions now have a local and a global entry point. Register r2 contains the TOC for local calls and register r12 contains the TOC for global calls. This patch makes the functional changes to the Valgrind tool. The patch makes the changes needed for the none/tests/ppc32 and none/tests/ppc64 Makefile.am. A number of the ppc specific tests have Endian dependencies that are not fixed in this patch. They are fixed in the next patch. Per Julian's comments renamed coregrind/m_dispatch/dispatch-ppc64-linux.S to coregrind/m_dispatch/dispatch-ppc64be-linux.S Created new file for LE coregrind/m_dispatch/dispatch-ppc64le-linux.S. The same was done for coregrind/m_syswrap/syscall-ppc-linux.S. Signed-off-by: Carl Love <carll@us.ibm.com> git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14239	2014-08-07 23:35:54 +00:00
Carl Love	914f75de32	This commit is for Bugzilla 334384. The Bugzilla contains patch 1 of 3 to add PPC64 LE support. The other two patches can be found in Bugzillas 334834 and 334836. The commit does not have a VEX commit associated with it. POWER PC, add initial Little Endian support The IBM POWER processor now supports both Big Endian and Little Endian. This patch renames the #defines with the name ppc64 to ppc64be for the BE specific code. This patch adds the Little Endian #define ppc64le to the Additionally, a few functions are renamed to remove BE from the name if the function is used by BE and LE. Functions that are BE specific have BE put in the name. The goals of this patch is to make sure #defines, function names and variables consistently use PPC64/ppc64 if it refers to BE and LE, PPC64BE/ppc64be if it is specific to BE, PPC64LE/ppc64le if it is LE specific. The patch does not break the code for PPC64 Big Endian. The test files memcheck/tests/atomic_incs.c, tests/power_insn_available.c and tests/power_insn_available.c are also updated to the new #define definition for PPC64 BE. Signed-off-by: Carl Love <carll@us.ibm.com> git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14238	2014-08-07 23:17:29 +00:00
Philippe Waroquiers	24e0fbf92a	fix 338024 inlined functions are not shown if DW_AT_ranges is used Based on investigation and patch by Matthias Schwarzott. (no small test found that reproduced the problem, but the equivalent patch given in bug 338024 fixed the inlined stack trace in a big shared lib). Would be nice however to have a small test case ... git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14236	2014-08-05 19:34:35 +00:00
Philippe Waroquiers	0cc60a627e	cfsi_m_ix array should only be indexed according to sizeof_m_ix, so decalre as a void*. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14218	2014-07-31 16:44:51 +00:00
Julian Seward	fdfada9f35	Add support for stack unwinding using the ARM32 specific EXIDX format. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14217	2014-07-31 14:25:29 +00:00
Florian Krohm	90e7e07d7e	Back out r14186 as it was identified to have caused a performance regression. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14201	2014-07-29 08:46:15 +00:00
Petar Jovanovic	2c6514c42f	fix comment when calling get_sym_name() Fix incorrect comment, spotted by Florian K. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14199	2014-07-28 15:52:04 +00:00
Florian Krohm	e15ffd7c60	No need to write the offset into a buffer when that buffer is not used. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14198	2014-07-27 14:46:52 +00:00
Florian Krohm	cc7bc87712	Fix a comment. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14195	2014-07-26 14:51:28 +00:00
Florian Krohm	e402f69a09	Change VG_(strncpy_safely) to use VG_(strncpy) to get the same padding behaviour. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14186	2014-07-24 12:50:03 +00:00
Philippe Waroquiers	cb1d628c6a	optimise readpdb.c filename and dirname handling, following r14158 r14158 introduced a dedup pool to store pairs (filename,dirname). The windows debug info reader (readpdb.c) performance was still to be improved, as calls to ML_(addFnDn) were done for each line loc to add. With this patch, the nr of calls to ML_(addFnDn) should be reduced significantly. Code has been compiled and regtested on linux, but no windows/wine test was done. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14183	2014-07-23 20:28:11 +00:00
Philippe Waroquiers	a6ffc04b1c	produce cfsi and str dedup pa at the same verbosity level git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14166	2014-07-15 19:46:15 +00:00
Mark Wielaard	91c93d3896	Bug 336619 valgrind --read-var-info=yes doesn't handle DW_TAG_restrict_type. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14165	2014-07-15 15:47:25 +00:00
Julian Seward	b8805a006b	arm64: get_Dwarf_Reg: at least handle the case of requesting XSP instead of failing. This makes some of the memcheck/tests/varinfo* tests work somewhat correctly on arm64-linux. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14164	2014-07-15 15:22:41 +00:00
Philippe Waroquiers	c99e3af927	This patch decreases significantly the memory needed to store the lineloc info. On a big executable, the trunk needs: dinfo: 134873088/71438336 max/curr mmap'd, 134607808/66717872 max/curr With the patch, we have: dinfo: 99065856/56836096 max/curr mmap'd, 97883776/51663656 max/curr So, peak dinfo memory decreases by about 36Mb, and final by 15Mb. (for info, valgrind 3.9.0 uses dinfo: 158941184/109666304 max/curr mmap'd, 156775944/107590656 max/curr So, compared to 3.9.0, dinfo peak decreases by about 40%, and the final memory is divided by more than 2). The memory decrease is obtained by: * using a dedup pool to store filename/dirname pair for the loctab source/line information. As typically, there is not a lot of such pairs, typically a UShort is good enough to identify a fn/dn pair in a dedup pool. To avoid losing memory due to alignment, the fndn indexes are stored in a "parallel" array to the DiLoc loctab array, with entries having 1, or 2 or 4 bytes according to the nr of fn/dn pairs in the dedup pool. See priv_storage.h comments for details. (there was a extensible WordArray local implementation in readdwarf.c. As with this change, we use an xarray, the local implementation was removed). * the memory needed for --read-inline-info is slightly decreased (-2Mb) by removing the (unused) dirname from the DiInlLoc struct. Handling dirname for inlined function caller implies to rework the dwarf3 parser read_filename_table common to the var and inlinfo parser. Waiting for this to be done, the dirname component is removed from DiInlLoc. * the stabs reader (readstabs.c) is broken since 3.9.0. For this change, the code has been updated to make it compile with the new DiLoc/FnDn dedup pool. As the code is completely broken, a vg_assert(0) has been put at the begin of the stabs reader. * the pdb reader (readpdb.c) has been trivially updated and should still work. It has not been tested (how do we test this ?). A follow-up patch will be done to avoid doing too many calls to ML_(addFnDn) : instead of having one call per ML_(addLineInfo), one should have a single call done when reading the filename table. This has also be tested in an outer/inner setup, to verify no memory leak/bugs. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14158	2014-07-14 21:20:57 +00:00
Philippe Waroquiers	076f1c0157	Apply text_debug_bias to inline IP extracted from dwarf3 Without this biasing, inline info is not correct for shared objects. Updated test varinfo5 to use --read-inline-info=yes and added an inline test case. Note: the varinfo reader does not understand the inlining info, and so variables in inlined functions are not properly described. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14146	2014-07-08 18:56:47 +00:00
Julian Seward	1ffd6d9b6e	OSX 10.9/10.8: Debuginfo reading FSM: enable recording of r-- mappings so as to enable arrival at acceptance states via calls to VG_(di_notify_vm_protect). git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14139	2014-07-08 07:55:44 +00:00
Julian Seward	ff80e8f74f	Improve debug printing for the should-we-load-debuginfo-now? finite state machine. No functional change. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14138	2014-07-08 07:50:19 +00:00
Philippe Waroquiers	4dac969352	Mark inline get function in image.c (called very often, and has a fast/slow case) This slightly improve the performance of reading the image. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14135	2014-07-06 18:35:18 +00:00
Philippe Waroquiers	2c502a3da6	Replace copy/pasted loop of the "range search" by doing a -1 in the loop for the "equal" case. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14133	2014-07-05 18:37:38 +00:00
Philippe Waroquiers	e37b530896	Small fixes/improvements post-cfsi_m improvement * Avoid printing the size of a null dedup pool * Avoid warnings of 2 unused variables on some platforms git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14132	2014-07-05 14:07:43 +00:00
Philippe Waroquiers	4a3b52c13c	Small comment fix for the UInt* cfsi_m index : 4 instead of 3 git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14130	2014-07-04 22:52:01 +00:00
Philippe Waroquiers	09073639b5	This patch decreases significantly the memory needed to store the cfsi info. On a big executable, the trunk needs: dinfo: 155844608/106737664 max/curr mmap'd 155572624/102276760 max/curr With the patch, we have: dinfo: 134873088/70389760 max/curr mmap'd 134607808/66717512 max/curr So, peak dinfo memory decreases by 21Mb, and final by 36Mb. The memory decrease is obtained by: * using a dedup pool to store the machine dependent part (cfsi_m) of the cfsi information as this information is highly duplicated. For x86 and arm64, the duplication factor of cfsi machine dependent part is very high (up to a factor 60). For arm64, it is more like a factor 3. A 'variable size' (1, 2 or 4 bytes) is automatically used to identify the cfsi_m, if there is less than or more than 255/64K different cfsi_m. * not storing explicitely the length of a range for which a cfsi_m is to be used: in a large majority of the cases, ranges are consecutive, and so the end of a range is just one byte before the start of the next range. So, we do not store the length of the ranges. If there is a hole between 2 ranges, the hole is stored explicitely as a range in which we have no cfsi_m information. On x86 and amd64, we have quite some holes (something like one hole every 7 cfsi). On arm64, we have very few holes (less than one hole every 50 cfsi). Even with the nr of holes on x86/amd64, it is more memory efficient to store the holes rather than to store the length of each cfsi. * Merging consecutive ranges that have the same cfsi_m info: Many cfsi are "mergeable": there is no hole between 2 cfsi, and their machine dependent part is identical (I guess the unwind info needed by valgrind is subset of the full unwind info, and so, the cfsi entries are not merged by the compiler, but can be merged for simple unwind). Depending on the platform (x86, amd64, arm64) and of the library/object file, we can have a significant nr of mergeable entries. The patch is not very small, but a lot is mechanical changes. The patch has been compiled and tested on x86/amd64/ppc32/ppc64 (but ppc does not use cfsi so that just verifies it compiles). It has been compiled on arm64, and "tested" by launching valgrind on one executable. It has not been compiled on s390 and mips. With some luck, maybe it will compile on these platforms. And if that uses the whole provision of luck for 2014, it might even work on these platforms :). If it does not compile, the fix should be straightforward. Runtime problems might be more tricky (but arm64 "worked out of the box" once x86/amd64 were ok). This has also be tested in an outer/inner setup, to verify no memory leak/bugs. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14129	2014-07-04 22:36:38 +00:00
Philippe Waroquiers	ae7b27f706	Implement VG_(arena_realloc_shrink) similar to realloc, but can only decrease the size of a block, does not change the address, does not need to alloc another block and copy the memory, and (if big enough) makes the excess memory available for other allocations. VG_(arena_realloc_shrink) is then used for debuginfo storage.c (replacing an allocation + copy). Also use it in the dedup pool, to recuperate the unused memory of the last pool. This also allows to re-increase the string pool size to the original 3.9.0 value of 64Kb. All this slightly decrease the peak and in use memory of dinfo. VG_(arena_realloc_shrink) will also be used to implement (in another patch) a dedup pool which "numbers" the allocated elements. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14122	2014-06-30 19:47:24 +00:00
Philippe Waroquiers	a2ea737046	Find the name of the inlined function through a DW_AT_specification The name is not necessarily found in the abstract origin, it can be in a referred to specification. If both a name and a DW_AT_specification is found in the abstract origin, the name will have priority over the name of the specification. (unclear if that can happen) git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14076	2014-06-21 12:41:48 +00:00
Philippe Waroquiers	cdfd3be6b7	This optimisation divides by 2.5 the time (user+sys) needed to read the inlined info of a big executable. On a slow pentium, reading the inline info now takes 5.5 seconds. The optimisation consists in having per dw3 abbreviation a structure allowing to skip efficiently the non interesting DIEs (i.e. the DIEs the parse_inl_DIE is not interested in). Mostly, the idea is to avoid calling the image abstraction, and replace this by just advancing the cursor (i.e. addition rather than a bunch of function calls to read the data). git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14075	2014-06-21 10:57:33 +00:00
Philippe Waroquiers	083986d244	Use macro TD3 defined as UNLIKELY(td3) for tracing to be sure the compiler understands that usually, we do not trace git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14074	2014-06-21 09:48:17 +00:00
Julian Seward	91350dc8a5	Add initial build support for Mac OS X 10.9 (Mavericks). Bug 326724 comment 12. (Diego Giagio, diego@giagio.com) git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14055	2014-06-20 11:48:38 +00:00
Philippe Waroquiers	c919ff8a7a	restructure dwarf3 DIE tracing * add a trace_DIE function * use it to trace a bad DIE and to trace all DIEs that are (maybe) read (due to the "avoid read twice" optimisation, the tracing was not so easy to read anymore => add an explicit trace_DIE call at the beginning of read_DIE) git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14050	2014-06-17 20:21:26 +00:00
Philippe Waroquiers	2ee4ccfb2a	optimisation : avoid double reading of a DIE when the DIE will be parsed by a DIE parser Instead of pre-reading the DIE, first let the parser(s) possibly parse the DIE. Read (to skip) the DIE data if no parser has parsed it. OTherwise, just jump to the end of the DIE as established by the parser that has read the DIE. This slightly improves the reading of inlined info. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14049	2014-06-16 21:49:42 +00:00
Philippe Waroquiers	e6201c3eb1	Use a string literal format to avoid a gcc warning (-Wformat-security) git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14047	2014-06-16 21:25:31 +00:00
Philippe Waroquiers	6ef6931a84	Fix random crash due to non-init inlparser when --read-var-info given but not --read-inline-info Wrong place for the assertion for the inlparser + move the "zero the parsers" out of the "if VG_(clo*)" conditions git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14044	2014-06-16 18:08:02 +00:00
Philippe Waroquiers	707197d56b	Add a comment to document a possible optimisation (avoid double reading of DIEs when one or more parsers will read them also) + add the name of the parser in the barf output. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14041	2014-06-15 21:49:13 +00:00
Philippe Waroquiers	01bcadac8f	When only reading inline info, no need to parse debug_types sections git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14040	2014-06-15 19:16:46 +00:00
Philippe Waroquiers	efbeef9e71	Fix some obsolete comments, now that we have an ht of parsed abbvs git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14039	2014-06-15 18:28:31 +00:00
Philippe Waroquiers	af510aa4c3	separate the tracing code in other function, call the tracing code only if trace active. This makes the code somewhat easier to read and somewhat more efficient git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14038	2014-06-15 18:06:20 +00:00
Philippe Waroquiers	ceaa5b2efe	This patch implements the support needed for stacktraces showing inlined function calls. See 278972 valgrind stacktraces and suppression do not handle inlined function call debuginfo Reading the inlined dwarf call info is activated using the new clo --read-inline-info=yes Default is currently no but an objective is to optimise the performance and memory in order to possibly set it on by default. (see below discussion about performances). Basically, the patch provides the following pieces: 1. Implement a new dwarf3 reader that reads the inlined call info 2. Some performance improvements done for this new parser, and on some common code between the new parser and the var info parser. 3. Use the parsed inlined info to produce stacktrace showing inlined calls 4. Use the parsed inlined info in the suppression matching and suppression generation 5. and of course, some reg tests 1. new dwarf3 reader: --------------------- Two options were possible: add the reading of the inlined info in the current var info dwarf reader, or add a 2nd reader. The 2nd approach was preferred, for the following reasons: The var info reader is slow, memory hungry and quite complex. Having a separate parsing phase for the inlined information is simpler/faster when just reading the inlined info. Possibly, a single parser would be faster when using both --read-var-info=yes and --read-inline-info=yes. However, var-info being extremely memory/cpu hungry, it is unlikely to be used often, and having a separate parsing for inlined info does in any case make not much difference. (--read-var-info=yes is also now less interesting thanks to commit r13991, which provides a fast and low memory "reasonable" location for an address). The inlined info parser reads the dwarf info to make calls to priv_storage.h ML_(addInlInfo). 2. performance optimisations ---------------------------- * the abbrev cache has been improved in revision r14035. * The new parser skips the non interesting DIEs (the var-info parser has no logic to skip uninteresting DIEs). * Some other minor perf optimisation here and there. In total now, on a big executable, 15 seconds CPU are needed to create the inlined info (on my slow x86 pentium). With regards to memory, the dinfo arena: with inlined info: 172281856/121085952 max/curr mmap'd without : 157892608/106721280 max/curr mmap'd, So, basically, inlined information costs about 15Mb of memory for my big executable (compared to first version of the patch, this is already using less memory, thanks to the strpool deduppoolalloc. The needed memory can probably be decreased somewhat more. 3. produce better stack traces ------------------------------ VG_(describe_IP) has a new argument InlIPCursor iipc which allows to describe inlined function calls by doing repetitive calls to describe_IP. See pub_tool_debuginfo.h for a description. 4. suppression generation and matching -------------------------------------- suppression generation now also uses an InlIPCursor iipc to generate a line for each inlined fn call. suppression matching: to allow suppression matching to match one IP to several function calls in a suppression entry, the 'inputCompleter' object (that allows to lazily generate function or object names for a stacktrace when matching an error with a suppression) has been generalised a little bit more to also lazily generate the input sequence. VG_(generic_match) has been updated so as to be more generic with respect to the input completer : when providing an input completer, VG_(generic_match) does not need anymore to produce/compute any input itself : this is all delegated to the input completer. 5. various regtests ------------------- to test stack traces with inlined calls, and suppressions of (some of) these errors using inlined fn calls matching. Work still to do: ----------------- * improve parsing performance * improve the memory overhead. * handling the directory name for files of the inlined function calls is not yet done. (probably implies to refactor some code) * see if m_errormgr.c *offsets arrays cannot be managed via xarray git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14036	2014-06-15 15:42:20 +00:00
Philippe Waroquiers	19a3689518	Improve performance of dwarf3 reader using a hashtable of parsed abbreviations For each DIE, the dwarf3 reader must know which data elements to read. These elements are described by an abbreviation. Re-reading these abbreviations for each DIE is costly as the location of the needed abbreviation is found by scanning the full abbv section, which is very costly. (A small cache of 32 abbv offsets in the abbv section somewhat decreases the cost, but reading the abbvs is still a hot spot, in particular for big debug informations). This patch: * adds an hash table of parsed abbreviations * all abbreviations for a CU are read in one single scan of the abbv section, when the CU header is read So, with the patch, the di image is not accessed anymore for reading the abbvs after the CU header parsing. On a big executable, --read-var-info=yes user cpu changes from trunk: 320 seconds to abbv cache: 270 seconds This further improves on a previous (not committed) abbv cache that was just caching up to 513 entries in the abbv pos cache and populating the cache with an initial scan. The user cpu for this version was 285 seconds. NB: this is some work in anticipation of a following patch that will add reading dwarf3 inlined information, with the hope to make this reading fast enough to activate it by default. Note: on the examples I looked at, all abbreviations were numbered starting from 1, with no holes. If that would always be the case, then one could use an xarray of parsed abbreviations rather than an hash table. However, I found nothing in the dwarf standard that guarantees that abbreviations are numbered from 1. So, the hash table. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14035	2014-06-15 10:51:14 +00:00
Philippe Waroquiers	266ab63f8b	Do not destroy the strpool if NULL It is possible that a debug info contains no string (and so strpool is never allocated). A protection to avoid accessing strpool was already necessary in ML_(canonicaliseTables) : if (di->strpool) VG_(freezeDedupPA) (di->strpool); So, if a similar debug info is released, we need the same protection to avoid accessing a NULL strpool. Detect by Julian on arm64, but not (at least easily) reproduced on amd64. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14033	2014-06-14 19:09:22 +00:00
Philippe Waroquiers	53df23f0a6	This patch adds a 'de-duplicating memory pool allocator': include/pub_tool_deduppoolalloc.h coregrind/pub_core_deduppoolalloc.h coregrind/m_deduppoolalloc.c and uses it (currently only) for the strings in m_debuginfo/storage.c The idea is that such ddup pool allocator will also be used for other highly duplicated information (e.g. the DiCFSI information), where significant gains can also be achieved. The dedup pool for strings also decreases significantly the memory needed by the read inline information (patch still to be committed, see bug 278972). When testing with a big executable (tacot_process), this reduces the size of the dinfo arena from trunk: 158941184/109760512 max/curr mmap'd, 156775944/107882728 max/curr, to ddup: 157892608/106614784 max/curr mmap'd, 156362160/101414712 max/curr (so 3Mb less mmap-ed once debug info is read, 1Mb less mmap-ed in peak, 6Mb less allocated once debug info is read). This is all gained due to the string which changes from: trunk: 17,434,704 in 266: di.storage.addStr.1 to ddup: 10,966,608 in 750: di.storage.addStr.1 (6.5Mb less memory used by strings) The gain in mmap-ed memory is smaller due to fragmentation. Probably one could decrease the fragmentation by using bigger size for the dedup pool, but then we would lose memory on the last allocated pool (and for small libraries, we often do not use much of a big pool block). Solution might be to increase the pool size but have a "shrink_block" operation. To be looked at in the future. In terms of performance, startup of a big executable (on an old pentium) is not influenced significantly (something like 0.1 seconds on 15 seconds startup for a big executable, on a slow pentium). The dedup pool uses a hash table. The hash function used currently is the VG_(adler32) check sum. It is reported (and visible also here) that this checksum is not a very good hash function (many collisions). To have statistics about collisions, use --stats -v -v -v As an example of the collisions, on the strings in debug info of memcheck tool on x86, one obtain: --4789-- dedupPA:di.storage.addStr.1 9983 allocs (8174 uniq) 11 pools (4820 bytes free in last pool) --4789-- nr occurences of chains of len N, N-plicated keys, N-plicated elts --4789-- N: 0 : nr chain 6975, nr keys 0, nr elts 0 --4789-- N: 1 : nr chain 3670, nr keys 6410, nr elts 8174 --4789-- N: 2 : nr chain 1070, nr keys 226, nr elts 0 --4789-- N: 3 : nr chain 304, nr keys 100, nr elts 0 --4789-- N: 4 : nr chain 104, nr keys 84, nr elts 0 --4789-- N: 5 : nr chain 72, nr keys 42, nr elts 0 --4789-- N: 6 : nr chain 44, nr keys 34, nr elts 0 --4789-- N: 7 : nr chain 18, nr keys 13, nr elts 0 --4789-- N: 8 : nr chain 17, nr keys 8, nr elts 0 --4789-- N: 9 : nr chain 4, nr keys 6, nr elts 0 --4789-- N:10 : nr chain 9, nr keys 4, nr elts 0 --4789-- N:11 : nr chain 1, nr keys 0, nr elts 0 --4789-- N:13 : nr chain 1, nr keys 1, nr elts 0 --4789-- total nr of unique chains: 12289, keys 6928, elts 8174 which shows that on 8174 different strings, we have only 6410 strings which have a unique hash value. As other examples, N:13 line shows we have 13 strings mapping to the same key. N:14 line shows we have 4 groups of 10 strings mapping to the same key, etc. So, adler32 is definitely a bad hash function. Trials have been done with another hash function, giving a much lower collision rate. So, a better (but still fast) hash function would probably be beneficial. To be looked at ... git-svn-id: svn://svn.valgrind.org/valgrind/trunk@14029	2014-06-14 16:30:09 +00:00
Philippe Waroquiers	55c12b3a18	On a big application linking with gtk, using the compilation options -ffunction-sections -fdata-sections and the linker option -Wl,--gc-sections, --read-var-info=yes gives the following: valgrind: m_debuginfo/d3basics.c:973 (vgModuleLocal_evaluate_GX): Assertion 'aMax == ~(Addr)0' failed. host stacktrace: ==18521== at 0x38057C54: show_sched_status_wrk (m_libcassert.c:308) ==18521== by 0x38057F50: report_and_quit (m_libcassert.c:367) ==18521== by 0x38058151: vgPlain_assert_fail (m_libcassert.c:432) ==18521== by 0x3813F084: vgModuleLocal_evaluate_GX (d3basics.c:973) ==18521== by 0x38098300: data_address_is_in_var (debuginfo.c:2769) ==18521== by 0x38099E26: vgPlain_get_data_description (debuginfo.c:3298) ... The problem is that -Wl,--gc-sections eliminates the unused functions but keeps some debug info for the functions or their compilation units. The dwarf entry has low and high pc, but both are equal to 0. The dwarf reader of Valgrind is confused by this, as the varstack becomes empty, while it should not. This then causes local (eliminated) variables to be put in the global scope, leading afterwards to evaluation errors when describing any other variables. The fix is to also push something on the varstack when a CU that has low and high pc given but with 0 value. This is similar to the varstack_push done for a CU that has no low pc, no high pc and no range. Despite considerable effort to make a small reproducer, the problem could only be produced with a big executable. After the fix, everything was working properly. The wrong behaviour for dwarf entries produce the following trace: <2><2ff291a>: Abbrev Number: 23 (DW_TAG_formal_parameter) DW_AT_name : AET DW_AT_decl_file : 1 DW_AT_decl_line : 243 DW_AT_type : <2ff2811> DW_AT_location : 18288554 Recording this variable, with 1 PC range(s) .... <2ff291a> addVar: level 0: AET :: EdgeTableEntry* Loc=GX(final){[0x0,0x8]=50,[0x9,0x1d]=53,[0x1e,0x26]=51,[0x27,0x29]=53,[0x2a,0x2f]=51,[0x44,0x4a]=53,[0x4d,0x5e]=51,[0x5f,0x62]=53} FrB=none declared at: gdkpolyreg-generic.c:243 ACQUIRE for range(s) [0x0,0xffffffff] The AET is a formal parameter of a function, but is wrongly added at level 0, with a PC range covering the full space. It has a Loc GX which uses non biased program counters (e.g. 0x0,0x8). This dwarf entry will require a FrB (and registers when evaluating) but no such things are available (or given) when evaluating a variable in the global scope. The fix is to handle compilation units with lo and hi pc == 0x0 similarly to a CU that has no lo and hi pc. With this fix, valgrind --read-var-info=yes could properly handle a big application with plenty of eliminated functions. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13941	2014-05-07 21:09:16 +00:00
Philippe Waroquiers	0a4b0b50a8	For the following c program: main(int argc) { typedef struct { int before_name; char name[argc]; int after_name; } namet; namet n; } compiled with gcc 4.7.4, the trunk --read-var-info=yes gives: parse_type_DIE: confused by: <2><51>: DW_TAG_structure_type DW_AT_decl_file : 1 DW_AT_decl_line : 4 DW_AT_sibling : <83> This is because that dwarf entry defines a struct with no size. This happens when the struct has a VLA array in the middle of a struct. This is a C gcc extension, and is a standard feature of Ada. The proper solution would be to have the size calculated at runtime, using the gnat extensions or dwarf entries (to be generated by the compiler). The patch fixes this problem by defining the size of such structure as 1 byte. Another approach tried was to put the max possible size. This had the disadvantage that any address on the stack was seen as belonging to this variable. This allows the description to work for the 1st byte of the variable but cannot properly describe the 2nd and following bytes : (gdb) p &n $9 = (namet *) 0xbefbc070 (gdb) mo c d 0xbefbc070 Address 0xBEFBC070 len 1 not defined: Uninitialised value at 0xBEFBC070 ==1396== Location 0xbefbc070 is 0 bytes inside n.before_name, ==1396== declared at crec.c:10, in frame #0 of thread 1 (gdb) mo c d 0xbefbc071 Address 0xBEFBC071 len 1 not defined: Uninitialised value at 0xBEFBC071 ==1396== Address 0xbefbc071 is on thread 1's stack (gdb) A possible refinement would be to use a huge size but have the logic of variable description understanding this and describing all between this var and hte next var on the stack as being in the VLA variable. In the meantime, the size 1 avoids --read-var-info=yes to fail. Also, the 'goto bad_DIE' have been replaced by a macro goto_bad_DIE that ensures the line nr at which the bad DIE has been detected is reported in the error msg. This makes it easier to understand what is the problem. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13938	2014-05-06 20:15:55 +00:00
Dejan Jevtic	958717766b	mips64: Add an extra case for mips64 in ML_(get_CFA). This patch resolves the issue with the memcheck/tests/dw4 on mips64 platform. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13895	2014-04-14 11:54:36 +00:00
Dejan Jevtic	6cb9b78f0d	mips32/64: According to DWARF version 4 in DW_TAG_structure_type we can have DW_AT_signature attribute. That wasn't the case in DWARF version 3. From DWARF version 4: If the complete declaration of a type has been placed in a separate type unit, an incomplete declaration of that type in the compilation unit may provide the unique 64-bit signature of the type using a DW_AT_signature attribute. This patch adds an extra field in TyStOrUn structure (typeR). This field is reference to other TyEnt that is placed in separate type unit. Because of the new field in TyStOrUn structure we need to add an extra case in parse_type_DIE that will put the right reference to other TyEnt and an extra case in ML_(describe_type) that will describe type when the ty->Te.TyStOrUn.typeR field is used. This patch is resolving the problem with memcheck/tests/dw4 test when it's compiled with compiler that will emit DW_AT_signature under the DW_TAG_structure_type. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13891	2014-04-04 10:20:03 +00:00
Dejan Jevtic	eb40633b9b	mips32: Add an extra case for mips32 in ML_(get_CFA) in witch Valgrind will call compute_cfa to get the call frame address. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13890	2014-04-04 10:02:03 +00:00
Julian Seward	68a2a4ce01	Initial implementation of CFI based stack unwinding for arm64-linux. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13774	2014-01-13 00:21:09 +00:00
Julian Seward	3f6d211236	Add support for ARMv8 AArch64 (the 64 bit ARM instruction set). git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13770	2014-01-12 12:54:00 +00:00
Dejan Jevtic	423d0643b9	mips32: Adding mips32/Android support to Valgrind. Necessary changes to Valgrind to support mips32 on Android. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13767	2013-12-27 09:06:55 +00:00
Mark Wielaard	98a63bf1d4	Bug 327916 - DW_TAG_typedef may have no name We already accepted DW_TAG_typedef without a name for Ada. But g++ for OpenMP can also emit such nameless DW_TAG_typedefs. Just accept them. Also fix up anonymous enum and typedef printing in tytypes.c. git-svn-id: svn://svn.valgrind.org/valgrind/trunk@13718	2013-11-24 17:19:35 +00:00

1 2 3 4 5 ...

438 Commits