[BACK]Return to README CVS log [TXT][DIR] Up to [local] / OpenXM_contrib / gc

Annotation of OpenXM_contrib/gc/README, Revision 1.1.1.3

1.1       maekawa     1: Copyright 1988, 1989 Hans-J. Boehm, Alan J. Demers
                      2: Copyright (c) 1991-1996 by Xerox Corporation.  All rights reserved.
1.1.1.2   maekawa     3: Copyright (c) 1996-1999 by Silicon Graphics.  All rights reserved.
                      4: Copyright (c) 1999 by Hewlett-Packard Company. All rights reserved.
1.1       maekawa     5:
                      6: THIS MATERIAL IS PROVIDED AS IS, WITH ABSOLUTELY NO WARRANTY EXPRESSED
                      7: OR IMPLIED.  ANY USE IS AT YOUR OWN RISK.
                      8:
                      9: Permission is hereby granted to use or copy this program
                     10: for any purpose,  provided the above notices are retained on all copies.
                     11: Permission to modify the code and to distribute modified code is granted,
                     12: provided the above notices are retained, and a notice that the code was
                     13: modified is included with the above copyright notice.
                     14:
1.1.1.3 ! maekawa    15: This is version 5.3 of a conservative garbage collector for C and C++.
1.1       maekawa    16:
                     17: You might find a more recent version of this at
                     18:
1.1.1.2   maekawa    19: http://www.hpl.hp.com/personal/Hans_Boehm/gc
1.1       maekawa    20:
                     21: HISTORY -
                     22:
                     23:   Early versions of this collector were developed as a part of research
                     24: projects supported in part by the National Science Foundation
                     25: and the Defense Advance Research Projects Agency.
1.1.1.3 ! maekawa    26: Much of the code was rewritten by Hans-J. Boehm (boehm@acm.org) at Xerox PARC,
        !            27: SGI, and HP Labs.
1.1       maekawa    28:
                     29: Some other contributors:
                     30:
                     31: More recent contributors are mentioned in the modification history at the
                     32: end of this file.  My apologies for any omissions.
                     33:
                     34: The SPARC specific code was contributed by Mark Weiser
                     35: (weiser@parc.xerox.com).  The Encore Multimax modifications were supplied by
                     36: Kevin Kenny (kenny@m.cs.uiuc.edu).  The adaptation to the RT is largely due
                     37: to Vernon Lee (scorpion@rice.edu), on machines made available by IBM.
                     38: Much of the HP specific code and a number of good suggestions for improving the
                     39: generic code are due to Walter Underwood (wunder@hp-ses.sde.hp.com).
                     40: Robert Brazile (brazile@diamond.bbn.com) originally supplied the ULTRIX code.
                     41: Al Dosser (dosser@src.dec.com) and Regis Cridlig (Regis.Cridlig@cl.cam.ac.uk)
                     42: subsequently provided updates and information on variation between ULTRIX
                     43: systems.  Parag Patel (parag@netcom.com) supplied the A/UX code.
1.1.1.2   maekawa    44: Jesper Peterson(jep@mtiame.mtia.oz.au), Michel Schinz, and
                     45: Martin Tauchmann (martintauchmann@bigfoot.com) supplied the Amiga port.
1.1       maekawa    46: Thomas Funke (thf@zelator.in-berlin.de(?)) and
                     47: Brian D.Carlstrom (bdc@clark.lcs.mit.edu) supplied the NeXT ports.
                     48: Douglas Steel (doug@wg.icl.co.uk) provided ICL DRS6000 code.
                     49: Bill Janssen (janssen@parc.xerox.com) supplied the SunOS dynamic loader
                     50: specific code. Manuel Serrano (serrano@cornas.inria.fr) supplied linux and
                     51: Sony News specific code.  Al Dosser provided Alpha/OSF/1 code.  He and
                     52: Dave Detlefs(detlefs@src.dec.com) also provided several generic bug fixes.
                     53: Alistair G. Crooks(agc@uts.amdahl.com) supplied the NetBSD and 386BSD ports.
                     54: Jeffrey Hsu (hsu@soda.berkeley.edu) provided the FreeBSD port.
                     55: Brent Benson (brent@jade.ssd.csd.harris.com) ported the collector to
                     56: a Motorola 88K processor running CX/UX (Harris NightHawk).
                     57: Ari Huttunen (Ari.Huttunen@hut.fi) generalized the OS/2 port to
                     58: nonIBM development environments (a nontrivial task).
                     59: Patrick Beard (beard@cs.ucdavis.edu) provided the initial MacOS port.
                     60: David Chase, then at Olivetti Research, suggested several improvements.
                     61: Scott Schwartz (schwartz@groucho.cse.psu.edu) supplied some of the
                     62: code to save and print call stacks for leak detection on a SPARC.
                     63: Jesse Hull and John Ellis supplied the C++ interface code.
                     64: Zhong Shao performed much of the experimentation that led to the
                     65: current typed allocation facility.  (His dynamic type inference code hasn't
                     66: made it into the released version of the collector, yet.)
                     67: (Blame for misinstallation of these modifications goes to the first author,
                     68: however.)
                     69:
                     70: OVERVIEW
                     71:
                     72:     This is intended to be a general purpose, garbage collecting storage
                     73: allocator.  The algorithms used are described in:
                     74:
                     75: Boehm, H., and M. Weiser, "Garbage Collection in an Uncooperative Environment",
                     76: Software Practice & Experience, September 1988, pp. 807-820.
                     77:
                     78: Boehm, H., A. Demers, and S. Shenker, "Mostly Parallel Garbage Collection",
                     79: Proceedings of the ACM SIGPLAN '91 Conference on Programming Language Design
                     80: and Implementation, SIGPLAN Notices 26, 6 (June 1991), pp. 157-164.
                     81:
                     82: Boehm, H., "Space Efficient Conservative Garbage Collection", Proceedings
                     83: of the ACM SIGPLAN '91 Conference on Programming Language Design and
                     84: Implementation, SIGPLAN Notices 28, 6 (June 1993), pp. 197-206.
                     85:
                     86:   Possible interactions between the collector and optimizing compilers are
                     87: discussed in
                     88:
                     89: Boehm, H., and D. Chase, "A Proposal for GC-safe C Compilation",
                     90: The Journal of C Language Translation 4, 2 (December 1992).
                     91:
                     92: and
                     93:
                     94: Boehm H., "Simple GC-safe Compilation", Proceedings
                     95: of the ACM SIGPLAN '96 Conference on Programming Language Design and
                     96: Implementation.
                     97:
                     98: (Both are also available from
                     99: http://reality.sgi.com/boehm/papers/, among other places.)
                    100:
                    101:   Unlike the collector described in the second reference, this collector
                    102: operates either with the mutator stopped during the entire collection
                    103: (default) or incrementally during allocations.  (The latter is supported
                    104: on only a few machines.)  It does not rely on threads, but is intended
                    105: to be thread-safe.
                    106:
                    107:   Some of the ideas underlying the collector have previously been explored
                    108: by others.  (Doug McIlroy wrote a vaguely similar collector that is part of
                    109: version 8 UNIX (tm).)  However none of this work appears to have been widely
                    110: disseminated.
                    111:
                    112:   Rudimentary tools for use of the collector as a leak detector are included, as
                    113: is a fairly sophisticated string package "cord" that makes use of the collector.
                    114: (See cord/README.)
                    115:
                    116:
                    117: GENERAL DESCRIPTION
                    118:
                    119:   This is a garbage collecting storage allocator that is intended to be
                    120: used as a plug-in replacement for C's malloc.
                    121:
                    122:   Since the collector does not require pointers to be tagged, it does not
                    123: attempt to ensure that all inaccessible storage is reclaimed.  However,
                    124: in our experience, it is typically more successful at reclaiming unused
                    125: memory than most C programs using explicit deallocation.  Unlike manually
                    126: introduced leaks, the amount of unreclaimed memory typically stays
                    127: bounded.
                    128:
                    129:   In the following, an "object" is defined to be a region of memory allocated
                    130: by the routines described below.
                    131:
                    132:   Any objects not intended to be collected must be pointed to either
                    133: from other such accessible objects, or from the registers,
                    134: stack, data, or statically allocated bss segments.  Pointers from
                    135: the stack or registers may point to anywhere inside an object.
                    136: The same is true for heap pointers if the collector is compiled with
                    137:  ALL_INTERIOR_POINTERS defined, as is now the default.
                    138:
                    139: Compiling without ALL_INTERIOR_POINTERS may reduce accidental retention
                    140: of garbage objects, by requiring pointers from the heap to to the beginning
                    141: of an object.  But this no longer appears to be a significant
                    142: issue for most programs.
                    143:
                    144: There are a number of routines which modify the pointer recognition
                    145: algorithm.  GC_register_displacement allows certain interior pointers
                    146: to be recognized even if ALL_INTERIOR_POINTERS is nor defined.
                    147: GC_malloc_ignore_off_page allows some pointers into the middle of large objects
                    148: to be disregarded, greatly reducing the probablility of accidental
                    149: retention of large objects.  For most purposes it seems best to compile
                    150: with ALL_INTERIOR_POINTERS and to use GC_malloc_ignore_off_page if
                    151: you get collector warnings from allocations of very large objects.
                    152: See README.debugging for details.
                    153:
                    154:   Note that pointers inside memory allocated by the standard "malloc" are not
                    155: seen by the garbage collector.  Thus objects pointed to only from such a
                    156: region may be prematurely deallocated.  It is thus suggested that the
                    157: standard "malloc" be used only for memory regions, such as I/O buffers, that
                    158: are guaranteed not to contain pointers to garbage collectable memory.
                    159: Pointers in C language automatic, static, or register variables,
                    160: are correctly recognized.  (Note that GC_malloc_uncollectable has semantics
                    161: similar to standard malloc, but allocates objects that are traced by the
                    162: collector.)
                    163:
                    164:   The collector does not always know how to find pointers in data
                    165: areas that are associated with dynamic libraries.  This is easy to
                    166: remedy IF you know how to find those data areas on your operating
                    167: system (see GC_add_roots).  Code for doing this under SunOS, IRIX 5.X and 6.X,
                    168: HP/UX, Alpha OSF/1, Linux, and win32 is included and used by default.  (See
                    169: README.win32 for win32 details.)  On other systems pointers from dynamic
                    170: library data areas may not be considered by the collector.
                    171:
                    172:   Note that the garbage collector does not need to be informed of shared
                    173: read-only data.  However if the shared library mechanism can introduce
                    174: discontiguous data areas that may contain pointers, then the collector does
                    175: need to be informed.
                    176:
                    177:   Signal processing for most signals may be deferred during collection,
                    178: and during uninterruptible parts of the allocation process.  Unlike
                    179: standard ANSI C mallocs, it can be safe to invoke malloc
                    180: from a signal handler while another malloc is in progress, provided
                    181: the original malloc is not restarted.  (Empirically, many UNIX
                    182: applications already assume this.)  To obtain this level  of signal
                    183: safety, remove the definition of -DNO_SIGNALS in Makefile.  This incurs
                    184: a minor performance penalty, and hence is no longer the default.
                    185:
                    186:   The allocator/collector can also be configured for thread-safe operation.
                    187: (Full signal safety can also be achieved, but only at the cost of two system
                    188: calls per malloc, which is usually unacceptable.)
                    189:
                    190: INSTALLATION AND PORTABILITY
                    191:
                    192:   As distributed, the macro SILENT is defined in Makefile.
                    193: In the event of problems, this can be removed to obtain a moderate
                    194: amount of descriptive output for each collection.
                    195: (The given statistics exhibit a few peculiarities.
                    196: Things don't appear to add up for a variety of reasons, most notably
                    197: fragmentation losses.  These are probably much more significant for the
                    198: contrived program "test.c" than for your application.)
                    199:
                    200:   Note that typing "make test" will automatically build the collector
                    201: and then run setjmp_test and gctest. Setjmp_test will give you information
                    202: about configuring the collector, which is useful primarily if you have
                    203: a machine that's not already supported.  Gctest is a somewhat superficial
                    204: test of collector functionality.  Failure is indicated by a core dump or
                    205: a message to the effect that the collector is broken.  Gctest takes about
                    206: 35 seconds to run on a SPARCstation 2. On a slower machine,
                    207: expect it to take a while.  It may use up to 8 MB of memory.  (The
                    208: multi-threaded version will use more.)  "Make test" will also, as
                    209: its last step, attempt to build and test the "cord" string library.
                    210: This will fail without an ANSI C compiler.
                    211:
                    212:   The Makefile will generate a library gc.a which you should link against.
                    213: Typing "make cords" will add the cord library to gc.a.
                    214: Note that this requires an ANSI C compiler.
                    215:
                    216:   It is suggested that if you need to replace a piece of the collector
                    217: (e.g. GC_mark_rts.c) you simply list your version ahead of gc.a on the
                    218:                work.)
                    219: ld command line, rather than replacing the one in gc.a.  (This will
                    220: generate numerous warnings under some versions of AIX, but it still
                    221: works.)
                    222:
                    223:   All include files that need to be used by clients will be put in the
                    224: include subdirectory.  (Normally this is just gc.h.  "Make cords" adds
                    225: "cord.h" and "ec.h".)
                    226:
                    227:   The collector currently is designed to run essentially unmodified on
                    228: machines that use a flat 32-bit or 64-bit address space.
                    229: That includes the vast majority of Workstations and X86 (X >= 3) PCs.
                    230: (The list here was deleted because it was getting too long and constantly
                    231: out of date.)
                    232:   It does NOT run under plain 16-bit DOS or Windows 3.X.  There are however
                    233: various packages (e.g. win32s, djgpp) that allow flat 32-bit address
                    234: applications to run under those systemsif the have at least an 80386 processor,
                    235: and several of those are compatible with the collector.
                    236:
                    237:   In a few cases (Amiga, OS/2, Win32, MacOS) a separate makefile
                    238: or equivalent is supplied.  Many of these have separate README.system
                    239: files.
                    240:
                    241:   Dynamic libraries are completely supported only under SunOS
                    242: (and even that support is not functional on the last Sun 3 release),
                    243: IRIX 5&6, HP-PA, Win32 (not Win32S) and OSF/1 on DEC AXP machines.
                    244: On other machines we recommend that you do one of the following:
                    245:
                    246:   1) Add dynamic library support (and send us the code).
                    247:   2) Use static versions of the libraries.
                    248:   3) Arrange for dynamic libraries to use the standard malloc.
                    249:      This is still dangerous if the library stores a pointer to a
                    250:      garbage collected object.  But nearly all standard interfaces
                    251:      prohibit this, because they deal correctly with pointers
                    252:      to stack allocated objects.  (Strtok is an exception.  Don't
                    253:      use it.)
                    254:
                    255:   In all cases we assume that pointer alignment is consistent with that
                    256: enforced by the standard C compilers.  If you use a nonstandard compiler
                    257: you may have to adjust the alignment parameters defined in gc_priv.h.
                    258:
                    259:   A port to a machine that is not byte addressed, or does not use 32 bit
                    260: or 64 bit addresses will require a major effort.  A port to plain MSDOS
                    261: or win16 is hard.
                    262:
                    263:   For machines not already mentioned, or for nonstandard compilers, the
                    264: following are likely to require change:
                    265:
                    266: 1.  The parameters in gcconfig.h.
                    267:       The parameters that will usually require adjustment are
                    268:    STACKBOTTOM,  ALIGNMENT and DATASTART.  Setjmp_test
                    269:    prints its guesses of the first two.
                    270:       DATASTART should be an expression for computing the
                    271:    address of the beginning of the data segment.  This can often be
                    272:    &etext.  But some memory management units require that there be
                    273:    some unmapped space between the text and the data segment.  Thus
                    274:    it may be more complicated.   On UNIX systems, this is rarely
                    275:    documented.  But the adb "$m" command may be helpful.  (Note
                    276:    that DATASTART will usually be a function of &etext.  Thus a
                    277:    single experiment is usually insufficient.)
                    278:      STACKBOTTOM is used to initialize GC_stackbottom, which
                    279:    should be a sufficient approximation to the coldest stack address.
                    280:    On some machines, it is difficult to obtain such a value that is
                    281:    valid across a variety of MMUs, OS releases, etc.  A number of
                    282:    alternatives exist for using the collector in spite of this.  See the
                    283:    discussion in gcconfig.h immediately preceding the various
                    284:    definitions of STACKBOTTOM.
                    285:
                    286: 2.  mach_dep.c.
                    287:       The most important routine here is one to mark from registers.
                    288:     The distributed file includes a generic hack (based on setjmp) that
                    289:     happens to work on many machines, and may work on yours.  Try
                    290:     compiling and running setjmp_t.c to see whether it has a chance of
                    291:     working.  (This is not correct C, so don't blame your compiler if it
                    292:     doesn't work.  Based on limited experience, register window machines
                    293:     are likely to cause trouble.  If your version of setjmp claims that
                    294:     all accessible variables, including registers, have the value they
                    295:     had at the time of the longjmp, it also will not work.  Vanilla 4.2 BSD
                    296:     on Vaxen makes such a claim.  SunOS does not.)
                    297:       If your compiler does not allow in-line assembly code, or if you prefer
                    298:     not to use such a facility, mach_dep.c may be replaced by a .s file
                    299:     (as we did for the MIPS machine and the PC/RT).
                    300:       At this point enough architectures are supported by mach_dep.c
                    301:     that you will rarely need to do more than adjust for assembler
                    302:     syntax.
                    303:
                    304: 3.  os_dep.c (and gc_priv.h).
                    305:          Several kinds of operating system dependent routines reside here.
                    306:        Many are optional.  Several are invoked only through corresponding
                    307:        macros in gc_priv.h, which may also be redefined as appropriate.
                    308:       The routine GC_register_data_segments is crucial.  It registers static
                    309:     data areas that must be traversed by the collector. (User calls to
                    310:     GC_add_roots may sometimes be used for similar effect.)
                    311:       Routines to obtain memory from the OS also reside here.
                    312:     Alternatively this can be done entirely by the macro GET_MEM
                    313:     defined in gc_priv.h.  Routines to disable and reenable signals
                    314:     also reside here if they are need by the macros DISABLE_SIGNALS
                    315:     and ENABLE_SIGNALS defined in gc_priv.h.
                    316:       In a multithreaded environment, the macros LOCK and UNLOCK
                    317:     in gc_priv.h will need to be suitably redefined.
                    318:       The incremental collector requires page dirty information, which
                    319:     is acquired through routines defined in os_dep.c.  Unless directed
                    320:     otherwise by gcconfig.h, these are implemented as stubs that simply
                    321:     treat all pages as dirty.  (This of course makes the incremental
                    322:     collector much less useful.)
                    323:
                    324: 4.  dyn_load.c
                    325:        This provides a routine that allows the collector to scan data
                    326:        segments associated with dynamic libraries.  Often it is not
                    327:        necessary to provide this routine unless user-written dynamic
                    328:        libraries are used.
                    329:
                    330:   For a different version of UN*X or different machines using the
                    331: Motorola 68000, Vax, SPARC, 80386, NS 32000, PC/RT, or MIPS architecture,
                    332: it should frequently suffice to change definitions in gcconfig.h.
                    333:
                    334:
                    335: THE C INTERFACE TO THE ALLOCATOR
                    336:
                    337:   The following routines are intended to be directly called by the user.
                    338: Note that usually only GC_malloc is necessary.  GC_clear_roots and GC_add_roots
                    339: calls may be required if the collector has to trace from nonstandard places
                    340: (e.g. from dynamic library data areas on a machine on which the
                    341: collector doesn't already understand them.)  On some machines, it may
                    342: be desirable to set GC_stacktop to a good approximation of the stack base.
                    343: (This enhances code portability on HP PA machines, since there is no
                    344: good way for the collector to compute this value.)  Client code may include
                    345: "gc.h", which defines all of the following, plus many others.
                    346:
                    347: 1)  GC_malloc(nbytes)
                    348:     - allocate an object of size nbytes.  Unlike malloc, the object is
                    349:       cleared before being returned to the user.  Gc_malloc will
                    350:       invoke the garbage collector when it determines this to be appropriate.
                    351:       GC_malloc may return 0 if it is unable to acquire sufficient
                    352:       space from the operating system.  This is the most probable
                    353:       consequence of running out of space.  Other possible consequences
                    354:       are that a function call will fail due to lack of stack space,
                    355:       or that the collector will fail in other ways because it cannot
                    356:       maintain its internal data structures, or that a crucial system
                    357:       process will fail and take down the machine.  Most of these
                    358:       possibilities are independent of the malloc implementation.
                    359:
                    360: 2)  GC_malloc_atomic(nbytes)
                    361:     - allocate an object of size nbytes that is guaranteed not to contain any
                    362:       pointers.  The returned object is not guaranteed to be cleared.
                    363:       (Can always be replaced by GC_malloc, but results in faster collection
                    364:       times.  The collector will probably run faster if large character
                    365:       arrays, etc. are allocated with GC_malloc_atomic than if they are
                    366:       statically allocated.)
                    367:
                    368: 3)  GC_realloc(object, new_size)
                    369:     - change the size of object to be new_size.  Returns a pointer to the
                    370:       new object, which may, or may not, be the same as the pointer to
                    371:       the old object.  The new object is taken to be atomic iff the old one
                    372:       was.  If the new object is composite and larger than the original object,
                    373:       then the newly added bytes are cleared (we hope).  This is very likely
                    374:       to allocate a new object, unless MERGE_SIZES is defined in gc_priv.h.
                    375:       Even then, it is likely to recycle the old object only if the object
                    376:       is grown in small additive increments (which, we claim, is generally bad
                    377:       coding practice.)
                    378:
                    379: 4)  GC_free(object)
                    380:     - explicitly deallocate an object returned by GC_malloc or
                    381:       GC_malloc_atomic.  Not necessary, but can be used to minimize
                    382:       collections if performance is critical.  Probably a performance
                    383:       loss for very small objects (<= 8 bytes).
                    384:
                    385: 5)  GC_expand_hp(bytes)
                    386:     - Explicitly increase the heap size.  (This is normally done automatically
                    387:       if a garbage collection failed to GC_reclaim enough memory.  Explicit
                    388:       calls to GC_expand_hp may prevent unnecessarily frequent collections at
                    389:       program startup.)
                    390:
                    391: 6)  GC_malloc_ignore_off_page(bytes)
                    392:        - identical to GC_malloc, but the client promises to keep a pointer to
                    393:          the somewhere within the first 256 bytes of the object while it is
                    394:          live.  (This pointer should nortmally be declared volatile to prevent
                    395:          interference from compiler optimizations.)  This is the recommended
                    396:          way to allocate anything that is likely to be larger than 100Kbytes
                    397:          or so.  (GC_malloc may result in failure to reclaim such objects.)
                    398:
                    399: 7)  GC_set_warn_proc(proc)
                    400:        - Can be used to redirect warnings from the collector.  Such warnings
                    401:          should be rare, and should not be ignored during code development.
                    402:
                    403: 8) GC_enable_incremental()
                    404:     - Enables generational and incremental collection.  Useful for large
                    405:       heaps on machines that provide access to page dirty information.
                    406:       Some dirty bit implementations may interfere with debugging
                    407:       (by catching address faults) and place restrictions on heap arguments
                    408:       to system calls (since write faults inside a system call may not be
                    409:       handled well).
                    410:
                    411: 9) Several routines to allow for registration of finalization code.
                    412:    User supplied finalization code may be invoked when an object becomes
                    413:    unreachable.  To call (*f)(obj, x) when obj becomes inaccessible, use
                    414:        GC_register_finalizer(obj, f, x, 0, 0);
                    415:    For more sophisticated uses, and for finalization ordering issues,
                    416:    see gc.h.
                    417:
                    418:   The global variable GC_free_space_divisor may be adjusted up from its
                    419: default value of 4 to use less space and more collection time, or down for
                    420: the opposite effect.  Setting it to 1 or 0 will effectively disable collections
                    421: and cause all allocations to simply grow the heap.
                    422:
                    423:   The variable GC_non_gc_bytes, which is normally 0, may be changed to reflect
                    424: the amount of memory allocated by the above routines that should not be
                    425: considered as a candidate for collection.  Careless use may, of course, result
                    426: in excessive memory consumption.
                    427:
                    428:   Some additional tuning is possible through the parameters defined
                    429: near the top of gc_priv.h.
                    430:
                    431:   If only GC_malloc is intended to be used, it might be appropriate to define:
                    432:
                    433: #define malloc(n) GC_malloc(n)
                    434: #define calloc(m,n) GC_malloc((m)*(n))
                    435:
                    436:   For small pieces of VERY allocation intensive code, gc_inl.h
                    437: includes some allocation macros that may be used in place of GC_malloc
                    438: and friends.
                    439:
                    440:   All externally visible names in the garbage collector start with "GC_".
                    441: To avoid name conflicts, client code should avoid this prefix, except when
                    442: accessing garbage collector routines or variables.
                    443:
                    444:   There are provisions for allocation with explicit type information.
                    445: This is rarely necessary.  Details can be found in gc_typed.h.
                    446:
                    447: THE C++ INTERFACE TO THE ALLOCATOR:
                    448:
                    449:   The Ellis-Hull C++ interface to the collector is included in
                    450: the collector distribution.  If you intend to use this, type
                    451: "make c++" after the initial build of the collector is complete.
                    452: See gc_cpp.h for the definition of the interface.  This interface
                    453: tries to approximate the Ellis-Detlefs C++ garbage collection
                    454: proposal without compiler changes.
                    455:
                    456: Cautions:
                    457: 1. Arrays allocated without new placement syntax are
                    458: allocated as uncollectable objects.  They are traced by the
                    459: collector, but will not be reclaimed.
                    460:
                    461: 2. Failure to use "make c++" in combination with (1) will
                    462: result in arrays allocated using the default new operator.
                    463: This is likely to result in disaster without linker warnings.
                    464:
                    465: 3. If your compiler supports an overloaded new[] operator,
                    466: then gc_cpp.cc and gc_cpp.h should be suitably modified.
                    467:
                    468: 4. Many current C++ compilers have deficiencies that
                    469: break some of the functionality.  See the comments in gc_cpp.h
                    470: for suggested workarounds.
                    471:
                    472: USE AS LEAK DETECTOR:
                    473:
                    474:   The collector may be used to track down leaks in C programs that are
                    475: intended to run with malloc/free (e.g. code with extreme real-time or
                    476: portability constraints).  To do so define FIND_LEAK in Makefile
                    477: This will cause the collector to invoke the report_leak
                    478: routine defined near the top of reclaim.c whenever an inaccessible
1.1.1.3 ! maekawa   479: object is found that has not been explicitly freed.  Such objects will
        !           480: also be automatically reclaimed.
1.1       maekawa   481:   Productive use of this facility normally involves redefining report_leak
                    482: to do something more intelligent.  This typically requires annotating
                    483: objects with additional information (e.g. creation time stack trace) that
                    484: identifies their origin.  Such code is typically not very portable, and is
                    485: not included here, except on SPARC machines.
                    486:   If all objects are allocated with GC_DEBUG_MALLOC (see next section),
                    487: then the default version of report_leak will report the source file
                    488: and line number at which the leaked object was allocated.  This may
                    489: sometimes be sufficient.  (On SPARC/SUNOS4 machines, it will also report
                    490: a cryptic stack trace.  This can often be turned into a sympolic stack
                    491: trace by invoking program "foo" with "callprocs foo".  Callprocs is
                    492: a short shell script that invokes adb to expand program counter values
                    493: to symbolic addresses.  It was largely supplied by Scott Schwartz.)
                    494:   Note that the debugging facilities described in the next section can
                    495: sometimes be slightly LESS effective in leak finding mode, since in
                    496: leak finding mode, GC_debug_free actually results in reuse of the object.
                    497: (Otherwise the object is simply marked invalid.)  Also note that the test
                    498: program is not designed to run meaningfully in FIND_LEAK mode.
                    499: Use "make gc.a" to build the collector.
                    500:
                    501: DEBUGGING FACILITIES:
                    502:
                    503:   The routines GC_debug_malloc, GC_debug_malloc_atomic, GC_debug_realloc,
                    504: and GC_debug_free provide an alternate interface to the collector, which
                    505: provides some help with memory overwrite errors, and the like.
                    506: Objects allocated in this way are annotated with additional
                    507: information.  Some of this information is checked during garbage
                    508: collections, and detected inconsistencies are reported to stderr.
                    509:
                    510:   Simple cases of writing past the end of an allocated object should
                    511: be caught if the object is explicitly deallocated, or if the
                    512: collector is invoked while the object is live.  The first deallocation
                    513: of an object will clear the debugging info associated with an
                    514: object, so accidentally repeated calls to GC_debug_free will report the
                    515: deallocation of an object without debugging information.  Out of
                    516: memory errors will be reported to stderr, in addition to returning
                    517: NIL.
                    518:
                    519:   GC_debug_malloc checking  during garbage collection is enabled
                    520: with the first call to GC_debug_malloc.  This will result in some
                    521: slowdown during collections.  If frequent heap checks are desired,
                    522: this can be achieved by explicitly invoking GC_gcollect, e.g. from
                    523: the debugger.
                    524:
                    525:   GC_debug_malloc allocated objects should not be passed to GC_realloc
                    526: or GC_free, and conversely.  It is however acceptable to allocate only
                    527: some objects with GC_debug_malloc, and to use GC_malloc for other objects,
                    528: provided the two pools are kept distinct.  In this case, there is a very
                    529: low probablility that GC_malloc allocated objects may be misidentified as
                    530: having been overwritten.  This should happen with probability at most
                    531: one in 2**32.  This probability is zero if GC_debug_malloc is never called.
                    532:
                    533:   GC_debug_malloc, GC_malloc_atomic, and GC_debug_realloc take two
                    534: additional trailing arguments, a string and an integer.  These are not
                    535: interpreted by the allocator.  They are stored in the object (the string is
                    536: not copied).  If an error involving the object is detected, they are printed.
                    537:
                    538:   The macros GC_MALLOC, GC_MALLOC_ATOMIC, GC_REALLOC, GC_FREE, and
                    539: GC_REGISTER_FINALIZER are also provided.  These require the same arguments
                    540: as the corresponding (nondebugging) routines.  If gc.h is included
                    541: with GC_DEBUG defined, they call the debugging versions of these
                    542: functions, passing the current file name and line number as the two
                    543: extra arguments, where appropriate.  If gc.h is included without GC_DEBUG
                    544: defined, then all these macros will instead be defined to their nondebugging
                    545: equivalents.  (GC_REGISTER_FINALIZER is necessary, since pointers to
                    546: objects with debugging information are really pointers to a displacement
                    547: of 16 bytes form the object beginning, and some translation is necessary
                    548: when finalization routines are invoked.  For details, about what's stored
                    549: in the header, see the definition of the type oh in debug_malloc.c)
                    550:
                    551: INCREMENTAL/GENERATIONAL COLLECTION:
                    552:
                    553: The collector normally interrupts client code for the duration of
                    554: a garbage collection mark phase.  This may be unacceptable if interactive
                    555: response is needed for programs with large heaps.  The collector
                    556: can also run in a "generational" mode, in which it usually attempts to
                    557: collect only objects allocated since the last garbage collection.
                    558: Furthermore, in this mode, garbage collections run mostly incrementally,
                    559: with a small amount of work performed in response to each of a large number of
                    560: GC_malloc requests.
                    561:
                    562: This mode is enabled by a call to GC_enable_incremental().
                    563:
                    564: Incremental and generational collection is effective in reducing
                    565: pause times only if the collector has some way to tell which objects
                    566: or pages have been recently modified.  The collector uses two sources
                    567: of information:
                    568:
                    569: 1. Information provided by the VM system.  This may be provided in
                    570: one of several forms.  Under Solaris 2.X (and potentially under other
                    571: similar systems) information on dirty pages can be read from the
                    572: /proc file system.  Under other systems (currently SunOS4.X) it is
                    573: possible to write-protect the heap, and catch the resulting faults.
                    574: On these systems we require that system calls writing to the heap
                    575: (other than read) be handled specially by client code.
                    576: See os_dep.c for details.
                    577:
                    578: 2. Information supplied by the programmer.  We define "stubborn"
                    579: objects to be objects that are rarely changed.  Such an object
                    580: can be allocated (and enabled for writing) with GC_malloc_stubborn.
                    581: Once it has been initialized, the collector should be informed with
                    582: a call to GC_end_stubborn_change.  Subsequent writes that store
                    583: pointers into the object must be preceded by a call to
                    584: GC_change_stubborn.
                    585:
                    586: This mechanism performs best for objects that are written only for
                    587: initialization, and such that only one stubborn object is writable
                    588: at once.  It is typically not worth using for short-lived
                    589: objects.  Stubborn objects are treated less efficiently than pointerfree
                    590: (atomic) objects.
                    591:
                    592: A rough rule of thumb is that, in the absence of VM information, garbage
                    593: collection pauses are proportional to the amount of pointerful storage
                    594: plus the amount of modified "stubborn" storage that is reachable during
                    595: the collection.
                    596:
                    597: Initial allocation of stubborn objects takes longer than allocation
                    598: of other objects, since other data structures need to be maintained.
                    599:
                    600: We recommend against random use of stubborn objects in client
                    601: code, since bugs caused by inappropriate writes to stubborn objects
                    602: are likely to be very infrequently observed and hard to trace.
                    603: However, their use may be appropriate in a few carefully written
                    604: library routines that do not make the objects themselves available
                    605: for writing by client code.
                    606:
                    607:
                    608: BUGS:
                    609:
                    610:   Any memory that does not have a recognizable pointer to it will be
                    611: reclaimed.  Exclusive-or'ing forward and backward links in a list
                    612: doesn't cut it.
                    613:   Some C optimizers may lose the last undisguised pointer to a memory
                    614: object as a consequence of clever optimizations.  This has almost
1.1.1.2   maekawa   615: never been observed in practice.  Send mail to boehm@acm.org
1.1       maekawa   616: for suggestions on how to fix your compiler.
                    617:   This is not a real-time collector.  In the standard configuration,
                    618: percentage of time required for collection should be constant across
                    619: heap sizes.  But collection pauses will increase for larger heaps.
                    620: (On SPARCstation 2s collection times will be on the order of 300 msecs
                    621: per MB of accessible memory that needs to be scanned.  Your mileage
                    622: may vary.)  The incremental/generational collection facility helps,
                    623: but is portable only if "stubborn" allocation is used.
1.1.1.2   maekawa   624:   Please address bug reports to boehm@acm.org.  If you are
1.1       maekawa   625: contemplating a major addition, you might also send mail to ask whether
                    626: it's already been done (or whether we tried and discarded it).
                    627:
                    628: RECENT VERSIONS:
                    629:
                    630:   Version 1.3 and immediately preceding versions contained spurious
                    631: assembly language assignments to TMP_SP.  Only the assignment in the PC/RT
                    632: code is necessary.  On other machines, with certain compiler options,
                    633: the assignments can lead to an unsaved register being overwritten.
                    634: Known to cause problems under SunOS 3.5 WITHOUT the -O option.  (With
                    635: -O the compiler recognizes it as dead code.  It probably shouldn't,
                    636: but that's another story.)
                    637:
                    638:   Version 1.4 and earlier versions used compile time determined values
                    639: for the stack base.  This no longer works on Sun 3s, since Sun 3/80s use
                    640: a different stack base.  We now use a straightforward heuristic on all
                    641: machines on which it is known to work (incl. Sun 3s) and compile-time
                    642: determined values for the rest.  There should really be library calls
                    643: to determine such values.
                    644:
                    645:   Version 1.5 and earlier did not ensure 8 byte alignment for objects
                    646: allocated on a sparc based machine.
                    647:
                    648:   Version 1.8 added ULTRIX support in gc_private.h.
                    649:
                    650:   Version 1.9 fixed a major bug in gc_realloc.
                    651:
                    652:   Version 2.0 introduced a consistent naming convention for collector
                    653: routines and added support for registering dynamic library data segments
                    654: in the standard mark_roots.c.  Most of the data structures were revamped.
                    655: The treatment of interior pointers was completely changed.  Finalization
                    656: was added.  Support for locking was added.  Object kinds were added.
                    657: We added a black listing facility to avoid allocating at addresses known
                    658: to occur as integers somewhere in the address space.  Much of this
                    659: was accomplished by adapting ideas and code from the PCR collector.
                    660: The test program was changed and expanded.
                    661:
                    662:   Version 2.1 was the first stable version since 1.9, and added support
                    663: for PPCR.
                    664:
                    665:   Version 2.2 added debugging allocation, and fixed various bugs.  Among them:
                    666: - GC_realloc could fail to extend the size of the object for certain large object sizes.
                    667: - A blatant subscript range error in GC_printf, which unfortunately
                    668:   wasn't exercised on machines with sufficient stack alignment constraints.
                    669: - GC_register_displacement did the wrong thing if it was called after
                    670:   any allocation had taken place.
                    671: - The leak finding code would eventually break after 2048 byte
                    672:   byte objects leaked.
                    673: - interface.c didn't compile.
                    674: - The heap size remained much too small for large stacks.
                    675: - The stack clearing code behaved badly for large stacks, and perhaps
                    676:   on HP/PA machines.
                    677:
                    678:   Version 2.3 added ALL_INTERIOR_POINTERS and fixed the following bugs:
                    679: - Missing declaration of etext in the A/UX version.
                    680: - Some PCR root-finding problems.
                    681: - Blacklisting was not 100% effective, because the plausible future
                    682:   heap bounds were being miscalculated.
                    683: - GC_realloc didn't handle out-of-memory correctly.
                    684: - GC_base could return a nonzero value for addresses inside free blocks.
                    685: - test.c wasn't really thread safe, and could erroneously report failure
                    686:   in a multithreaded environment.  (The locking primitives need to be
                    687:   replaced for other threads packages.)
                    688: - GC_CONS was thoroughly broken.
                    689: - On a SPARC with dynamic linking, signals stayed diabled while the
                    690:   client code was running.
                    691:   (Thanks to Manuel Serrano at INRIA for reporting the last two.)
                    692:
                    693:   Version 2.4 added GC_free_space_divisor as a tuning knob, added
                    694:   support for OS/2 and linux, and fixed the following bugs:
                    695: - On machines with unaligned pointers (e.g. Sun 3), every 128th word could
                    696:   fail to be considered for marking.
                    697: - Dynamic_load.c erroneously added 4 bytes to the length of the data and
                    698:   bss sections of the dynamic library.  This could result in a bad memory
                    699:   reference if the actual length was a multiple of a page.  (Observed on
                    700:   Sun 3.  Can probably also happen on a Sun 4.)
                    701:   (Thanks to Robert Brazile for pointing out that the Sun 3 version
                    702:   was broken.  Dynamic library handling is still broken on Sun 3s
                    703:   under 4.1.1U1, but apparently not 4.1.1.  If you have such a machine,
                    704:   use -Bstatic.)
                    705:
                    706:   Version 2.5 fixed the following bugs:
                    707: - Removed an explicit call to exit(1)
                    708: - Fixed calls to GC_printf and GC_err_printf, so the correct number of
                    709:   arguments are always supplied.  The OS/2 C compiler gets confused if
                    710:   the number of actuals and the number of formals differ.  (ANSI C
                    711:   doesn't require this to work.  The ANSI sanctioned way of doing things
                    712:   causes too many compatibility problems.)
                    713:
                    714:   Version 3.0  added generational/incremental collection and stubborn
                    715:   objects.
                    716:
                    717:   Version 3.1 added the following features:
                    718: - A workaround for a SunOS 4.X SPARC C compiler
                    719:   misfeature that caused problems when the collector was turned into
                    720:   a dynamic library.
                    721: - A fix for a bug in GC_base that could result in a memory fault.
                    722: - A fix for a performance bug (and several other misfeatures) pointed
                    723:   out by Dave Detlefs and Al Dosser.
                    724: - Use of dirty bit information for static data under Solaris 2.X.
                    725: - DEC Alpha/OSF1 support (thanks to Al Dosser).
                    726: - Incremental collection on more platforms.
                    727: - A more refined heap expansion policy.  Less space usage by default.
                    728: - Various minor enhancements to reduce space usage, and to reduce
                    729:   the amount of memory scanned by the collector.
                    730: - Uncollectable allocation without per object overhead.
                    731: - More conscientious handling of out-of-memory conditions.
                    732: - Fixed a bug in debugging stubborn allocation.
                    733: - Fixed a bug that resulted in occasional erroneous reporting of smashed
                    734:   objects with debugging allocation.
                    735: - Fixed bogus leak reports of size 4096 blocks with FIND_LEAK.
                    736:
                    737:   Version 3.2 fixed a serious and not entirely repeatable bug in
                    738:   the incremental collector.  It appeared only when dirty bit info
                    739:   on the roots was available, which is normally only under Solaris.
                    740:   It also added GC_general_register_disappearing_link, and some
                    741:   testing code.  Interface.c disappeared.
                    742:
                    743:   Version 3.3 fixes several bugs and adds new ports:
                    744: - PCR-specific bugs.
                    745: - Missing locking in GC_free, redundant FASTUNLOCK
                    746:   in GC_malloc_stubborn, and 2 bugs in
                    747:   GC_unregister_disappearing_link.
                    748:   All of the above were pointed out by Neil Sharman
                    749:   (neil@cs.mu.oz.au).
                    750: - Common symbols allocated by the SunOS4.X dynamic loader
                    751:   were not included in the root set.
                    752: - Bug in GC_finalize (reported by Brian Beuning and Al Dosser)
                    753: - Merged Amiga port from Jesper Peterson (untested)
                    754: - Merged NeXT port from Thomas Funke (significantly
                    755:   modified and untested)
                    756:
                    757:   Version 3.4:
                    758: - Fixed a performance bug in GC_realloc.
                    759: - Updated the amiga port.
                    760: - Added NetBSD and 386BSD ports.
                    761: - Added cord library.
                    762: - Added trivial performance enhancement for
                    763:   ALL_INTERIOR_POINTERS.  (Don't scan last word.)
                    764:
                    765:   Version 3.5
                    766: - Minor collections now mark from roots only once, if that
                    767:   doesn't cause an excessive pause.
                    768: - The stack clearing heuristic was refined to prevent anomalies
                    769:   with very heavily recursive programs and sparse stacks.
                    770: - Fixed a bug that prevented mark stack growth in some cases.
                    771:   GC_objects_are_marked should be set to TRUE after a call
                    772:   to GC_push_roots and as part of GC_push_marked, since
                    773:   both can now set mark bits.  I think this is only a performance
                    774:   bug, but I wouldn't bet on it.  It's certainly very hard to argue
                    775:   that the old version was correct.
                    776: - Fixed an incremental collection bug that prevented it from
                    777:   working at all when HBLKSIZE != getpagesize()
                    778: - Changed dynamic_loading.c to include gc_priv.h before testing
                    779:   DYNAMIC_LOADING.  SunOS dynamic library scanning
                    780:   must have been broken in 3.4.
                    781: - Object size rounding now adapts to program behavior.
                    782: - Added a workaround (provided by Manuel Serrano and
                    783:   colleagues) to a long-standing SunOS 4.X (and 3.X?) ld bug
                    784:   that I had incorrectly assumed to have been squished.
                    785:   The collector was broken if the text segment size was within
                    786:   32 bytes of a multiple of 8K bytes, and if the beginning of
                    787:   the data segment contained interesting roots.  The workaround
                    788:   assumes a demand-loadable executable.  The original may have
                    789:   have "worked" in some other cases.
                    790: - Added dynamic library support under IRIX5.
                    791: - Added support for EMX under OS/2 (thanks to Ari Huttunen).
                    792:
                    793: Version 3.6:
                    794: - fixed a bug in the mark stack growth code that was introduced
                    795:   in 3.4.
                    796: - fixed Makefile to work around DEC AXP compiler tail recursion
                    797:   bug.
                    798:
                    799: Version 3.7:
                    800: - Added a workaround for an HP/UX compiler bug.
                    801: - Fixed another stack clearing performance bug.  Reworked
                    802:   that code once more.
                    803:
                    804: Version 4.0:
                    805: - Added support for Solaris threads (which was possible
                    806:   only by reimplementing some fraction of Solaris threads,
                    807:   since Sun doesn't currently make the thread debugging
                    808:   interface available).
                    809: - Added non-threads win32 and win32S support.
                    810: - (Grudgingly, with suitable muttering of obscenities) renamed
                    811:   files so that the collector distribution could live on a FAT
                    812:   file system.  Files that are guaranteed to be useless on
                    813:   a PC still have long names.  Gc_inline.h and gc_private.h
                    814:   still exist, but now just include  gc_inl.h and gc_priv.h.
                    815: - Fixed a really obscure bug in finalization that could cause
                    816:   undetected mark stack overflows.  (I would be surprised if
                    817:   any real code ever tickled this one.)
                    818: - Changed finalization code to dynamically resize the hash
                    819:   tables it maintains.  (This probably does not matter for well-
                    820:   -written code.  It no doubt does for C++ code that overuses
                    821:   destructors.)
                    822: - Added typed allocation primitives.  Rewrote the marker to
                    823:   accommodate them with more reasonable efficiency.  This
                    824:   change should also speed up marking for GC_malloc allocated
                    825:   objects a little.  See gc_typed.h for new primitives.
                    826: - Improved debugging facilities slightly.  Allocation time
                    827:   stack traces are now kept by default on SPARC/SUNOS4.
                    828:   (Thanks to Scott Schwartz.)
                    829: - Added better support for small heap applications.
                    830: - Significantly extended cord package.  Fixed a bug in the
                    831:   implementation of lazily read files.  Printf and friends now
                    832:   have cord variants.  Cord traversals are a bit faster.
                    833: - Made ALL_INTERIOR_POINTERS recognition the default.
                    834: - Fixed de so that it can run in constant space, independent
                    835:   of file size.  Added simple string searching to cords and de.
                    836: - Added the Hull-Ellis C++ interface.
                    837: - Added dynamic library support for OSF/1.
                    838:   (Thanks to Al Dosser and Tim Bingham at DEC.)
                    839: - Changed argument to GC_expand_hp to be expressed
                    840:   in units of bytes instead of heap blocks.  (Necessary
                    841:   since the heap block size now varies depending on
                    842:   configuration.  The old version was never very clean.)
                    843: - Added GC_get_heap_size().  The previous "equivalent"
                    844:   was broken.
                    845: - Restructured the Makefile a bit.
                    846:
                    847: Since version 4.0:
                    848: - Changed finalization implementation to guarantee that
                    849:   finalization procedures are called outside of the allocation
                    850:   lock, making direct use of the interface a little less dangerous.
                    851:   MAY BREAK EXISTING CLIENTS that assume finalizers
                    852:   are protected by a lock.  Since there seem to be few multithreaded
                    853:   clients that use finalization, this is hopefully not much of
                    854:   a problem.
                    855: - Fixed a gross bug in CORD_prev.
                    856: - Fixed a bug in blacklst.c that could result in unbounded
                    857:   heap growth during startup on machines that do not clear
                    858:   memory obtained from the OS (e.g. win32S).
                    859: - Ported de editor to win32/win32S.  (This is now the only
                    860:   version with a mouse-sensitive UI.)
                    861: - Added GC_malloc_ignore_off_page to allocate large arrays
                    862:   in the presence of ALL_INTERIOR_POINTERS.
                    863: - Changed GC_call_with_alloc_lock to not disable signals in
                    864:   the single-threaded case.
                    865: - Reduced retry count in GC_collect_or_expand for garbage
                    866:   collecting when out of memory.
                    867: - Made uncollectable allocations bypass black-listing, as they
                    868:   should.
                    869: - Fixed a bug in typed_test in test.c that could cause (legitimate)
                    870:   GC crashes.
                    871: - Fixed some potential synchronization problems in finalize.c
                    872: - Fixed a real locking problem in typd_mlc.c.
                    873: - Worked around an AIX 3.2 compiler feature that results in
                    874:   out of bounds memory references.
                    875: - Partially worked around an IRIX5.2 beta problem (which may
                    876:   or may not persist to the final release).
                    877: - Fixed a bug in the heap integrity checking code that could
                    878:   result in explicitly deallocated objects being identified as
                    879:   smashed.  Fixed a bug in the dbg_mlc stack saving code
                    880:   that caused old argument pointers to be considered live.
                    881: - Fixed a bug in CORD_ncmp (and hence CORD_str).
                    882: - Repaired the OS2 port, which had suffered from bit rot
                    883:   in 4.0.  Worked around what appears to be CSet/2 V1.0
                    884:   optimizer bug.
                    885: - Fixed a Makefile bug for target "c++".
                    886:
                    887: Since version 4.1:
                    888: - Multiple bug fixes/workarounds in the Solaris threads version.
                    889:   (It occasionally failed to locate some register contents for
                    890:   marking.  It also turns out that thr_suspend and friends are
                    891:   unreliable in Solaris 2.3.  Dirty bit reads appear
                    892:   to be unreliable under some weird
                    893:   circumstances.  My stack marking code
                    894:   contained a serious performance bug.  The new code is
                    895:   extremely defensive, and has not failed in several cpu
                    896:   hours of testing.  But  no guarantees ...)
                    897: - Added MacOS support (thanks to Patrick Beard.)
                    898: - Fixed several syntactic bugs in gc_c++.h and friends.  (These
                    899:   didn't bother g++, but did bother most other compilers.)
                    900:   Fixed gc_c++.h finalization interface.  (It didn't.)
                    901: - 64 bit alignment for allocated objects was not guaranteed in a
                    902:   few cases in which it should have been.
                    903: - Added GC_malloc_atomic_ignore_off_page.
                    904: - Added GC_collect_a_little.
                    905: - Added some prototypes to gc.h.
                    906: - Some other minor bug fixes (notably in Makefile).
                    907: - Fixed OS/2 / EMX port (thanks to Ari Huttunen).
                    908: - Fixed AmigaDOS port. (thanks to Michel Schinz).
                    909: - Fixed the DATASTART definition under Solaris.  There
                    910:   was a 1 in 16K chance of the collector missing the first
                    911:   64K of static data (and thus crashing).
                    912: - Fixed some blatant anachronisms in the README file.
                    913: - Fixed PCR-Makefile for upcoming PPCR release.
                    914:
                    915: Since version 4.2:
                    916: - Fixed SPARC alignment problem with GC_DEBUG.
                    917: - Fixed Solaris threads /proc workaround.  The real
                    918:   problem was an interaction with mprotect.
                    919: - Incorporated fix from Patrick Beard for gc_c++.h (now gc_cpp.h).
                    920: - Slightly improved allocator space utilization by
                    921:   fixing the GC_size_map mechanism.
                    922: - Integrated some Sony News and MIPS RISCos 4.51
                    923:   patches.  (Thanks to Nobuyuki Hikichi of
                    924:   Software Research Associates, Inc. Japan)
                    925: - Fixed HP_PA alignment problem.  (Thanks to
                    926:   xjam@cork.cs.berkeley.edu.)
                    927: - Added GC_same_obj and friends.  Changed GC_base
                    928:   to return 0 for pointers past the end of large objects.
                    929:   Improved GC_base performance with ALL_INTERIOR_POINTERS
                    930:   on machines with a slow integer mod operation.
                    931:   Added GC_PTR_ADD, GC_PTR_STORE, etc. to prepare
                    932:   for preprocessor.
                    933: - changed the default on most UNIX machines to be that
                    934:   signals are not disabled during critical GC operations.
                    935:   This is still ANSI-conforming, though somewhat dangerous
                    936:   in the presence of signal handlers. But the performance
                    937:   cost of the alternative is sometimes problematic.
                    938:   Can be changed back with a minor Makefile edit.
                    939: - renamed IS_STRING in gc.h, to CORD_IS_STRING, thus
                    940:   following my own naming convention.  Added the function
                    941:   CORD_to_const_char_star.
                    942: - Fixed a gross bug in GC_finalize.  Symptom: occasional
                    943:   address faults in that function.  (Thanks to Anselm
                    944:   Baird-Smith (Anselm.BairdSmith@inria.fr)
                    945: - Added port to ICL DRS6000 running DRS/NX.  Restructured
                    946:   things a bit to factor out common code, and remove obsolete
                    947:   code.  Collector should now run under SUNOS5 with either
                    948:   mprotect or /proc dirty bits.  (Thanks to Douglas Steel
                    949:   (doug@wg.icl.co.uk)).
                    950: - More bug fixes and workarounds for Solaris 2.X.  (These were
                    951:   mostly related to putting the collector in a dynamic library,
                    952:   which didn't really work before.  Also SOLARIS_THREADS
                    953:   didn't interact well with dl_open.)  Thanks to btlewis@eng.sun.com.
                    954: - Fixed a serious performance bug on the DEC Alpha.  The text
                    955:   segment was getting registered as part of the root set.
                    956:   (Amazingly, the result was still fast enough that the bug
                    957:   was not conspicuous.) The fix works on OSF/1, version 1.3.
                    958:   Hopefully it also works on other versions of OSF/1 ...
                    959: - Fixed a bug in GC_clear_roots.
                    960: - Fixed a bug in GC_generic_malloc_words_small that broke
                    961:   gc_inl.h.  (Reported by Antoine de Maricourt.  I broke it
                    962:   in trying to tweak the Mac port.)
                    963: - Fixed some problems with cord/de under Linux.
                    964: - Fixed some cord problems, notably with CORD_riter4.
                    965: - Added DG/UX port.
                    966:   Thanks to Ben A. Mesander (ben@piglet.cr.usgs.gov)
                    967: - Added finalization registration routines with weaker ordering
                    968:   constraints.  (This is necessary for C++ finalization with
                    969:   multiple inheritance, since the compiler often adds self-cycles.)
                    970: - Filled the holes in the SCO port. (Thanks to Michael Arnoldus
                    971:   <chime@proinf.dk>.)
                    972: - John Ellis' additions to the C++ support:  From John:
                    973:
                    974: * I completely rewrote the documentation in the interface gc_c++.h
                    975: (later renamed gc_cpp.h).  I've tried to make it both clearer and more
                    976: precise.
                    977:
                    978: * The definition of accessibility now ignores pointers from an
                    979: finalizable object (an object with a clean-up function) to itself.
                    980: This allows objects with virtual base classes to be finalizable by the
                    981: collector.  Compilers typically implement virtual base classes using
                    982: pointers from an object to itself, which under the old definition of
                    983: accessibility prevented objects with virtual base classes from ever
                    984: being collected or finalized.
                    985:
                    986: * gc_cleanup now includes gc as a virtual base.  This was enabled by
                    987: the change in the definition of accessibility.
                    988:
                    989: * I added support for operator new[].  Since most (all?) compilers
                    990: don't yet support operator new[], it is conditionalized on
                    991: -DOPERATOR_NEW_ARRAY.  The code is untested, but its trivial and looks
                    992: correct.
                    993:
                    994: * The test program test_gc_c++ (later renamed test_cpp.cc)
                    995: tries to test for the C++-specific functionality not tested by the
                    996: other programs.
                    997: - Added <unistd.h> include to misc.c.  (Needed for ppcr.)
                    998: - Added PowerMac port. (Thanks to Patrick Beard again.)
                    999: - Fixed "srcdir"-related Makefile problems.  Changed things so
                   1000:   that all externally visible include files always appear in the
                   1001:   include subdirectory of the source.  Made gc.h directly
                   1002:   includable from C++ code.  (These were at Per
                   1003:   Bothner's suggestion.)
                   1004: - Changed Intel code to also mark from ebp (Kevin Warne's
                   1005:   suggestion).
                   1006: - Renamed C++ related files so they could live in a FAT
                   1007:   file system. (Charles Fiterman's suggestion.)
                   1008: - Changed Windows NT Makefile to include C++ support in
                   1009:   gc.lib.  Added C++ test as Makefile target.
                   1010:
                   1011: Since version 4.3:
                   1012:  - ASM_CLEAR_CODE was erroneously defined for HP
                   1013:    PA machines, resulting in a compile error.
                   1014:  - Fixed OS/2 Makefile to create a library.  (Thanks to
                   1015:    Mark Boulter (mboulter@vnet.ibm.com)).
                   1016:  - Gc_cleanup objects didn't work if they were created on
                   1017:    the stack.  Fixed.
                   1018:  - One copy of Gc_cpp.h in the distribution was out of
                   1019:    synch, and failed to document some known compiler
                   1020:    problems with explicit destructor invocation.  Partially
                   1021:    fixed.  There are probably other compilers on which
                   1022:    gc_cleanup is miscompiled.
                   1023:  - Fixed Makefile to pass C compiler flags to C++ compiler.
                   1024:  - Added Mac fixes.
                   1025:  - Fixed os_dep.c to work around what appears to be
                   1026:    a new and different VirtualQuery bug under newer
                   1027:    versions of win32S.
                   1028:  - GC_non_gc_bytes was not correctly maintained by
                   1029:    GC_free.  Fixed.  Thanks to James Clark (jjc@jclark.com).
                   1030:  - Added GC_set_max_heap_size.
                   1031:  - Changed allocation code to ignore blacklisting if it is preventing
                   1032:    use of a very large block of memory.  This has the advantage
                   1033:    that naive code allocating very large objects is much more
                   1034:    likely to work.  The downside is you might no
                   1035:    longer find out that such code should really use
                   1036:    GC_malloc_ignore_off_page.
                   1037:  - Changed GC_printf under win32 to close and reopen the file
                   1038:    between calls.  FAT file systems otherwise make the log file
                   1039:    useless for debugging.
                   1040:  - Added GC_try_to_collect and GC_get_bytes_since_gc.  These
                   1041:    allow starting an abortable collection during idle times.
                   1042:    This facility does not require special OS support.  (Thanks to
                   1043:    Michael Spertus of Geodesic Systems for suggesting this.  It was
                   1044:    actually an easy addition.  Kumar Srikantan previously added a similar
                   1045:    facility to a now ancient version of the collector.  At the time
                   1046:    this was much harder, and the result was less convincing.)
                   1047:  - Added some support for the Borland development environment.  (Thanks
                   1048:    to John Ellis and Michael Spertus.)
                   1049:  - Removed a misfeature from checksums.c that caused unexpected
                   1050:    heap growth.  (Thanks to Scott Schwartz.)
                   1051:  - Changed finalize.c to call WARN if it encounters a finalization cycle.
                   1052:    WARN is defined in gc_priv.h to write a message, usually to stdout.
                   1053:    In many environments, this may be inappropriate.
                   1054:  - Renamed NO_PARAMS in gc.h to GC_NO_PARAMS, thus adhering to my own
                   1055:    naming convention.
                   1056:  - Added GC_set_warn_proc to intercept warnings.
                   1057:  - Fixed Amiga port. (Thanks to Michel Schinz (schinz@alphanet.ch).)
                   1058:  - Fixed a bug in mark.c that could result in an access to unmapped
                   1059:    memory from GC_mark_from_mark_stack on machines with unaligned
                   1060:    pointers.
                   1061:  - Fixed a win32 specific performance bug that could result in scanning of
                   1062:    objects allocated with the system malloc.
                   1063:  - Added REDIRECT_MALLOC.
                   1064:
                   1065: Since version 4.4:
                   1066:  - Fixed many minor and one major README bugs. (Thanks to Franklin Chen
                   1067:    (chen@adi.com) for pointing out many of them.)
                   1068:  - Fixed ALPHA/OSF/1 dynamic library support. (Thanks to Jonathan Bachrach
                   1069:    (jonathan@harlequin.com)).
                   1070:  - Added incremental GC support (MPROTECT_VDB) for Linux (with some
                   1071:    help from Bruno Haible).
                   1072:  - Altered SPARC recognition tests in gc.h and config.h (mostly as
                   1073:    suggested by Fergus Henderson).
                   1074:  - Added basic incremental GC support for win32, as implemented by
                   1075:    Windows NT and Windows 95.  GC_enable_incremental is a noop
                   1076:    under win32s, which doesn't implement enough of the VM interface.
                   1077:  - Added -DLARGE_CONFIG.
                   1078:  - Fixed GC_..._ignore_off_page to also function without
                   1079:    -DALL_INTERIOR_POINTERS.
                   1080:  - (Hopefully) fixed RS/6000 port.  (Only the test was broken.)
                   1081:  - Fixed a performance bug in the nonincremental collector running
                   1082:    on machines supporting incremental collection with MPROTECT_VDB
                   1083:    (e.g. SunOS 4, DEC AXP).  This turned into a correctness bug under
                   1084:    win32s with win32 incremental collection.  (Not all memory protection
                   1085:    was disabled.)
                   1086:  - Fixed some ppcr related bit rot.
                   1087:  - Caused dynamic libraries to be unregistered before reregistering.
                   1088:    The old way turned out to be a performance bug on some machines.
                   1089:  - GC_root_size was not properly maintained under MSWIN32.
                   1090:  - Added -DNO_DEBUGGING and GC_dump.
                   1091:  - Fixed a couple of bugs arising with SOLARIS_THREADS +
                   1092:    REDIRECT_MALLOC.
                   1093:  - Added NetBSD/M68K port.  (Thanks to Peter Seebach
                   1094:    <seebs@taniemarie.solon.com>.)
                   1095:  - Fixed a serious realloc bug.  For certain object sizes, the collector
                   1096:    wouldn't scan the expanded part of the object.  (Thanks to Clay Spence
                   1097:    (cds@peanut.sarnoff.com) for noticing the problem, and helping me to
                   1098:    track it down.)
                   1099:
                   1100: Since version 4.5:
                   1101:  - Added Linux ELF support.  (Thanks to Arrigo Triulzi <arrigo@ic.ac.uk>.)
                   1102:  - GC_base crashed if it was called before any other GC_ routines.
                   1103:    This could happen if a gc_cleanup object was allocated outside the heap
                   1104:    before any heap allocation.
                   1105:  - The heap expansion heuristic was not stable if all objects had finalization
                   1106:    enabled.  Fixed finalize.c to count memory in finalization queue and
                   1107:    avoid explicit deallocation.  Changed alloc.c to also consider this count.
                   1108:    (This is still not recommended.  It's expensive if nothing else.)  Thanks
                   1109:    to John Ellis for pointing this out.
                   1110:  - GC_malloc_uncollectable(0) was broken.  Thanks to Phong Vo for pointing
                   1111:    this out.
                   1112:  - The collector didn't compile under Linux 1.3.X.  (Thanks to Fred Gilham for
                   1113:    pointing this out.)  The current workaround is ugly, but expected to be
                   1114:    temporary.
                   1115:  - Fixed a formatting problem for SPARC stack traces.
                   1116:  - Fixed some '=='s in os_dep.c that should have been assignments.
                   1117:    Fortunately these were in code that should never be executed anyway.
                   1118:    (Thanks to Fergus Henderson.)
                   1119:  - Fixed the heap block allocator to only drop blacklisted blocks in small
                   1120:    chunks.  Made BL_LIMIT self adjusting.  (Both of these were in response
                   1121:    to heap growth observed by Paul Graham.)
                   1122:  - Fixed the Metrowerks/68K Mac code to also mark from a6.  (Thanks
                   1123:    to Patrick Beard.)
                   1124:  - Significantly updated README.debugging.
                   1125:  - Fixed some problems with longjmps out of signal handlers, especially under
                   1126:    Solaris.  Added a workaround for the fact that siglongjmp doesn't appear to
                   1127:    do the right thing with -lthread under Solaris.
                   1128:  - Added MSDOS/djgpp port.  (Thanks to Mitch Harris  (maharri@uiuc.edu).)
                   1129:  - Added "make reserved_namespace" and "make user_namespace".  The
                   1130:    first renames ALL "GC_xxx" identifiers as "_GC_xxx".  The second is the
                   1131:    inverse transformation.  Note that doing this is guaranteed to break all
                   1132:    clients written for the other names.
                   1133:  - descriptor field for kind NORMAL in GC_obj_kinds with ADD_BYTE_AT_END
                   1134:    defined should be -ALIGNMENT not WORDS_TO_BYTES(-1).  This is
                   1135:    a serious bug on machines with pointer alignment of less than a word.
                   1136:  - GC_ignore_self_finalize_mark_proc didn't handle pointers to very near the
                   1137:    end of the object correctly.  Caused failures of the C++ test on a DEC Alpha
                   1138:    with g++.
                   1139:  - gc_inl.h still had problems.  Partially fixed.  Added warnings at the
                   1140:    beginning to hopefully specify the remaining dangers.
                   1141:  - Added DATAEND definition to config.h.
                   1142:  - Fixed some of the .h file organization.  Fixed "make floppy".
                   1143:
                   1144: Since version 4.6:
                   1145:  - Fixed some compilation problems with -DCHECKSUMS (thanks to Ian Searle)
                   1146:  - Updated some Mac specific files to synchronize with Patrick Beard.
                   1147:  - Fixed a serious bug for machines with non-word-aligned pointers.
                   1148:    (Thanks to Patrick Beard for pointing out the problem.  The collector
                   1149:    should fail almost any conceivable test immediately on such machines.)
                   1150:
                   1151: Since version 4.7:
                   1152:  - Changed a "comment" in a MacOS specific part of mach-dep.c that caused
                   1153:    gcc to fail on other platforms.
                   1154:
                   1155: Since version 4.8
                   1156:  - More README.debugging fixes.
                   1157:  - Objects ready for finalization, but not finalized in the same GC
                   1158:    cycle, could be prematurely collected.  This occasionally happened
                   1159:    in test_cpp.
                   1160:  - Too little memory was obtained from the system for very large
                   1161:    objects.  That could cause a heap explosion if these objects were
                   1162:    not contiguous (e.g. under PCR), and too much of them was blacklisted.
                   1163:  - Due to an improper initialization, the collector was too hesitant to
                   1164:    allocate blacklisted objects immediately after system startup.
                   1165:  - Moved GC_arrays from the data into the bss segment by not explicitly
                   1166:    initializing it to zero.  This significantly
                   1167:    reduces the size of executables, and probably avoids some disk accesses
                   1168:    on program startup.  It's conceivable that it might break a port that I
                   1169:    didn't test.
                   1170:  - Fixed EMX_MAKEFILE to reflect the gc_c++.h to gc_cpp.h renaming which
                   1171:    occurred a while ago.
                   1172:
                   1173: Since 4.9:
                   1174:  - Fixed a typo around a call to GC_collect_or_expand in alloc.c.  It broke
                   1175:    handling of out of memory.  (Thanks to Patrick Beard for noticing.)
                   1176:
                   1177: Since 4.10:
                   1178:  - Rationalized (hopefully) GC_try_to_collect in an incremental collection
                   1179:    environment.  It appeared to not handle a call while a collection was in
                   1180:    progress, and was otherwise too conservative.
                   1181:  - Merged GC_reclaim_or_delete_all into GC_reclaim_all to get rid of some
                   1182:    code.
                   1183:  - Added Patrick Beard's Mac fixes, with substantial completely untested
                   1184:    modifications.
                   1185:  - Fixed the MPROTECT_VDB code to deal with large pages and imprecise
                   1186:    fault addresses (as on an UltraSPARC running Solaris 2.5).  Note that this
                   1187:    was not a problem in the default configuration, which uses PROC_VDB.
                   1188:  - The DEC Alpha assembly code needed to restore $gp between calls.
                   1189:    Thanks to Fergus Henderson for tracking this down and supplying a
                   1190:    patch.
                   1191:  - The write command for "de" was completely broken for large files.
                   1192:    I used the easiest portable fix, which involved changing the semantics
                   1193:    so that f.new is written instead of overwriting f.  That's safer anyway.
                   1194:  - Added README.solaris2 with a discussion of the possible problems of
                   1195:    mixing the collector's sbrk allocation with malloc/realloc.
                   1196:  - Changed the data segment starting address for SGI machines.  The
                   1197:    old code failed under IRIX6.
                   1198:  - Required double word alignment for MIPS.
                   1199:  - Various minor fixes to remove warnings.
                   1200:  - Attempted to fix some Solaris threads problems reported by Zhiying Chen.
                   1201:    In particular, the collector could try to fork a thread with the
                   1202:    world stopped as part of GC_thr_init.  It also failed to deal with
                   1203:    the case in which the original thread terminated before the whole
                   1204:    process did.
                   1205:  - Added -DNO_EXECUTE_PERMISSION.  This has a major performance impact
                   1206:    on the incremental collector under Irix, and perhaps under other
                   1207:    operating systems.
                   1208:  - Added some code to support allocating the heap with mmap.  This may
                   1209:    be preferable under some circumstances.
                   1210:  - Integrated dynamic library support for HP.
                   1211:    (Thanks to Knut Tvedten <knuttv@ifi.uio.no>.)
                   1212:  - Integrated James Clark's win32 threads support, and made a number
                   1213:    of changes to it, many of which were suggested by Pontus Rydin.
                   1214:    This is still not 100% solid.
                   1215:  - Integrated Alistair Crooks' support for UTS4 running on an Amdahl
                   1216:    370-class machine.
                   1217:  - Fixed a serious bug in explicitly typed allocation.  Objects requiring
                   1218:    large descriptors where handled in a way that usually resulted in
                   1219:    a segmentation fault in the marker.  (Thanks to Jeremy Fitzhardinge
                   1220:    for helping to track this down.)
                   1221:  - Added partial support for GNU win32 development.  (Thanks to Fergus
                   1222:    Henderson.)
                   1223:  - Added optional support for Java-style finalization semantics.  (Thanks
                   1224:    to Patrick Bridges.)  This is recommended only for Java implementations.
                   1225:  - GC_malloc_uncollectable faulted instead of returning 0 when out of
                   1226:    memory.  (Thanks to dan@math.uiuc.edu for noticing.)
                   1227:  - Calls to GC_base before the collector was initialized failed on a
                   1228:    DEC Alpha.  (Thanks to Matthew Flatt.)
                   1229:  - Added base pointer checking to GC_REGISTER_FINALIZER in debugging
                   1230:    mode, at the suggestion of Jeremy Fitzhardinge.
                   1231:  - GC_debug_realloc failed for uncollectable objects.  (Thanks to
                   1232:    Jeremy Fitzhardinge.)
                   1233:  - Explicitly typed allocation could crash if it ran out of memory.
                   1234:    (Thanks to Jeremy Fitzhardinge.)
                   1235:  - Added minimal support for a DEC Alpha running Linux.
                   1236:  - Fixed a problem with allocation of objects whose size overflowed
                   1237:    ptrdiff_t.  (This now fails unconditionally, as it should.)
                   1238:  - Added the beginning of Irix pthread support.
                   1239:  - Integrated Xiaokun Zhu's fixes for djgpp 2.01.
                   1240:  - Added SGI-style STL allocator support (gc_alloc.h).
                   1241:  - Fixed a serious bug in README.solaris2.  Multithreaded programs must include
                   1242:    gc.h with SOLARIS_THREADS defined.
                   1243:  - Changed GC_free so it actually deallocates uncollectable objects.
                   1244:    (Thanks to Peter Chubb for pointing out the problem.)
                   1245:  - Added Linux ELF support for dynamic libararies.  (Thanks again to
                   1246:    Patrick Bridges.)
                   1247:  - Changed the Borland cc configuration so that the assembler is not
                   1248:    required.
                   1249:  - Fixed a bug in the C++ test that caused it to fail in 64-bit
                   1250:    environments.
                   1251:
                   1252: Since 4.11:
                   1253:  - Fixed ElfW definition in dyn_load.c. (Thanks to Fergus Henderson.)
                   1254:    This prevented the dynamic library support from compiling on some
                   1255:    older ELF Linux systems.
                   1256:  - Fixed UTS4 port (which I apparently mangled during the integration)
                   1257:    (Thanks to again to Alistair Crooks.)
                   1258:  - "Make C++" failed on Suns with SC4.0, due to a problem with "bool".
                   1259:    Fixed in gc_priv.h.
                   1260:  - Added more pieces for GNU win32.  (Thanks to Timothy N. Newsham.)
                   1261:    The current state of things should suffice for at least some
                   1262:    applications.
                   1263:  - Changed the out of memory retry count handling as suggested by
                   1264:    Kenjiro Taura.  (This matters only if GC_max_retries > 0, which
                   1265:    is no longer the default.)
                   1266:  - If a /proc read failed repeatedly, GC_written_pages was not updated
                   1267:    correctly.  (Thanks to Peter Chubb for diagnosing this.)
                   1268:  - Under unlikely circumstances, the allocator could infinite loop in
                   1269:    an out of memory situation.  (Thanks again to Kenjiro Taura for
                   1270:    identifying the problem and supplying a fix.)
                   1271:  - Fixed a syntactic error in the DJGPP code.  (Thanks to Fergus
                   1272:    Henderson for finding this by inspection.)  Also fixed a test program
                   1273:    problem with DJGPP (Thanks to Peter Monks.)
                   1274:  - Atomic uncollectable objects were not treated correctly by the
                   1275:    incremental collector.  This resulted in weird log statistics and
                   1276:    occasional performance problems.  (Thanks to Peter Chubb for pointing
                   1277:    this out.)
                   1278:  - Fixed some problems resulting from compilers that dont define
                   1279:    __STDC__.  In this case void * and char * were used inconsistently
                   1280:    in some cases.  (Void * should not have been used at all.  If
                   1281:    you have an ANSI superset compiler that does not define __STDC__,
                   1282:    please compile with -D__STDC__=0. Thanks to Manuel Serrano and others
                   1283:    for pointing out the problem.)
                   1284:  - Fixed a compilation problem on Irix with -n32 and -DIRIX_THREADS.
                   1285:    Also fixed some other IRIX_THREADS problems which may or may not have
                   1286:    had observable symptoms.
                   1287:  - Fixed an HP PA compilation problem in dyn_load.c.  (Thanks to
                   1288:    Philippe Queinnec.)
                   1289:  - SEGV fault handlers sometimes did not get reset correctly.  (Thanks
                   1290:    to David Pickens.)
                   1291:  - Added a fix for SOLARIS_THREADS on Intel.  (Thanks again to David
                   1292:    Pickens.)  This probably needs more work to become functional.
                   1293:  - Fixed struct sigcontext_struct in os_dep.c for compilation under
                   1294:    Linux 2.1.X.        (Thanks to Fergus Henderson.)
                   1295:  - Changed the DJGPP STACKBOTTOM and DATASTART values to those suggested
                   1296:    by Kristian Kristensen.  These may still not be right, but it is
                   1297:    it is likely to work more often than what was there before.  They may
                   1298:    even be exactly right.
                   1299:  - Added a #include <string.h> to test_cpp.cc.  This appears to help
                   1300:    with HP/UX and gcc.  (Thanks to assar@sics.se.)
                   1301:  - Version 4.11 failed to run in incremental mode on recent 64-bit Irix
                   1302:    kernels.  This was a problem related to page unaligned heap segments.
                   1303:    Changed the code to page align heap sections on all platforms.
                   1304:    (I had mistakenly identified this as a kernel problem earlier.
                   1305:    It was not.)
                   1306:  - Version 4.11 did not make allocated storage executable, except on
                   1307:    one or two platforms, due to a bug in a #if test.  (Thanks to Dave
                   1308:    Grove for pointing this out.)
                   1309:  - Added sparc_sunos4_mach_dep.s to support Sun's compilers under SunOS4.
                   1310:  - Added GC_exclude_static_roots.
                   1311:  - Fixed the object size mapping algorithm.  This shouldn't matter,
                   1312:    but the old code was ugly.
                   1313:  - Heap checking code could die if one of the allocated objects was
                   1314:    larger than its base address.  (Unsigned underflow problem.  Thanks
                   1315:    to Clay Spence for isolating the problem.)
                   1316:  - Added RS6000 (AIX) dynamic library support and fixed STACK_BOTTOM.
                   1317:    (Thanks to Fred Stearns.)
                   1318:  - Added Fergus Henderson's patches for improved robustness with large
                   1319:    heaps and lots of blacklisting.
                   1320:  - Added Peter Chubb's changes to support Solaris Pthreads, to support
                   1321:    MMAP allocation in Solaris, to allow Solaris to find dynamic libraries
                   1322:    through /proc, to add malloc_typed_ignore_off_page, and a few other
                   1323:    minor features and bug fixes.
                   1324:  - The Solaris 2 port should not use sbrk.  I received confirmation from
                   1325:    Sun that the use of sbrk and malloc in the same program is not
                   1326:    supported.  The collector now defines USE_MMAP by default on Solaris.
                   1327:  - Replaced the djgpp makefile with Gary Leavens' version.
                   1328:  - Fixed MSWIN32 detection test.
                   1329:  - Added Fergus Henderson's patches to allow putting the collector into
                   1330:    a DLL under GNU win32.
                   1331:  - Added Ivan V. Demakov's port to Watcom C on X86.
                   1332:  - Added Ian Piumarta's Linux/PowerPC port.
                   1333:  - On Brian Burton's suggestion added PointerFreeGC to the placement
                   1334:    options in gc_cpp.h.  This is of course unsafe, and may be controversial.
                   1335:    On the other hand, it seems to be needed often enough that it's worth
                   1336:    adding as a standard facility.
                   1337:
                   1338: Since 4.12:
                   1339:  - Fixed a crucial bug in the Watcom port.  There was a redundant decl
                   1340:    of GC_push_one in gc_priv.h.
                   1341:  - Added FINALIZE_ON_DEMAND.
                   1342:  - Fixed some pre-ANSI cc problems in test.c.
                   1343:  - Removed getpagesize() use for Solaris.  It seems to be missing in one
                   1344:    or two versions.
                   1345:  - Fixed bool handling for SPARCCompiler version 4.2.
                   1346:  - Fixed some files in include that had gotten unlinked from the main
                   1347:    copy.
                   1348:  - Some RS/6000 fixes (missing casts).  Thanks to Toralf Foerster.
                   1349:  - Fixed several problems in GC_debug_realloc, affecting mostly the
                   1350:    FIND_LEAK case.
                   1351:  - GC_exclude_static_roots contained a buggy unsigned comparison to
                   1352:    terminate a loop.  (Thanks to Wilson Ho.)
                   1353:  - CORD_str failed if the substring occurred at the last possible position.
                   1354:    (Only affects cord users.)
                   1355:  - Fixed Linux code to deal with RedHat 5.0 and integrated Peter Bigot's
                   1356:    os_dep.c code for dealing with various Linux versions.
                   1357:  - Added workaround for Irix pthreads sigaction bug and possible signal
                   1358:    misdirection problems.
                   1359: Since alpha1:
                   1360:  - Changed RS6000 STACKBOTTOM.
                   1361:  - Integrated Patrick Beard's Mac changes.
                   1362:  - Alpha1 didn't compile on Irix m.n, m < 6.
                   1363:  - Replaced Makefile.dj with a new one from Gary Leavens.
                   1364:  - Added Andrew Stitcher's changes to support SCO OpenServer.
                   1365:  - Added PRINT_BLACK_LIST, to allow debugging of high densities of false
                   1366:    pointers.
                   1367:  - Added code to debug allocator to keep track of return address
                   1368:    in GC_malloc caller, thus giving a bit more context.
                   1369:  - Changed default behavior of large block allocator to more
                   1370:    aggressively avoid fragmentation.  This is likely to slow down the
                   1371:    collector when it succeeds at reducing space cost.
                   1372:  - Integrated Fergus Henderson's CYGWIN32 changes.  They are untested,
                   1373:    but needed for newer versions.
                   1374:  - USE_MMAP had some serious bugs.  This caused the collector to fail
                   1375:    consistently on Solaris with -DSMALL_CONFIG.
                   1376:  - Added Linux threads support, thanks largely to Fergus Henderson.
                   1377: Since alpha2:
                   1378:  - Fixed more Linux threads problems.
                   1379:  - Changed default GC_free_space_divisor to 3 with new large block allocation.
                   1380:    (Thanks to Matthew Flatt for some measurements that suggest the old
                   1381:    value sometimes favors space too much over time.)
                   1382:  - More CYGWIN32 fixes.
                   1383:  - Integrated Tyson-Dowd's Linux-M68K port.
                   1384:  - Minor HP PA and DEC UNIX fixes from Fergus Henderson.
                   1385:  - Integrated Christoffe Raffali's Linux-SPARC changes.
                   1386:  - Allowed for one more GC fixup iteration after a full GC in incremental
                   1387:    mode.  Some quick measurements suggested that this significantly
                   1388:    reduces pause times even with smaller GC_RATE values.
                   1389:  - Moved some more GC data structures into GC_arrays.  This decreases
                   1390:    pause times and GC overhead, but makes debugging slightly less convenient.
                   1391:  - Fixed namespace pollution problem ("excl_table").
                   1392:  - Made GC_incremental a constant for -DSMALL_CONFIG, hopefully shrinking
                   1393:    that slightly.
                   1394:  - Added some win32 threads fixes.
                   1395:  - Integrated Ivan Demakov and David Stes' Watcom fixes.
                   1396:  - Various other minor fixes contributed by many people.
                   1397:  - Renamed config.h to gcconfig.h, since config.h tends to be used for
                   1398:    many other things.
                   1399:  - Integrated Matthew Flatt's support for 68K MacOS "far globals".
                   1400:  - Fixed up some of the dynamic library Makefile targets for consistency
                   1401:    across platforms.
                   1402:  - Fixed a USE_MMAP typo that caused out-of-memory handling to fail
                   1403:    on Solaris.
                   1404:  - Added code to test.c to test thread creation a bit more.
                   1405:  - Integrated GC_win32_free_heap, as suggested by Ivan Demakov.
                   1406:  - Fixed Solaris 2.7 stack base finding problem.  (This may actually
                   1407:    have been done in an earlier alpha release.)
                   1408: Since alpha3:
                   1409:  - Fixed MSWIN32 recognition test, which interfered with cygwin.
                   1410:  - Removed unnecessary gc_watcom.asm from distribution.  Removed
                   1411:    some obsolete README.win32 text.
                   1412:  - Added Alpha Linux incremental GC support.  (Thanks to Philipp Tomsich
                   1413:    for code for retrieving the fault address in a signal handler.)
                   1414:    Changed Linux signal handler context argument to be a pointer.
                   1415:  - Took care of some new warnings generated by the 7.3 SGI compiler.
                   1416:  - Integrated Phillip Musumeci's FreeBSD/ELF fixes.
                   1417:  - -DIRIX_THREADS was broken with the -o32 ABI (typo in gc_priv.h>
                   1418:
                   1419: Since 4.13:
                   1420:  - Fixed GC_print_source_ptr to not use a prototype.
                   1421:  - generalized CYGWIN test.
                   1422:  - gc::new did the wrong thing with PointerFreeGC placement.
                   1423:    (Thanks to Rauli Ruohonen.)
                   1424:  - In the ALL_INTERIOR_POINTERS (default) case, some callee-save register
                   1425:    values could fail to be scanned if the register was saved and
                   1426:    reused in a GC frame.  This showed up in verbose mode with gctest
                   1427:    compiled with an unreleased SGI compiler.  I vaguely recall an old
                   1428:    bug report that may have been related.  The bug was probably quite old.
                   1429:    (The problem was that the stack scanning could be deferred until
                   1430:    after the relevant frame was overwritten, and the new save location
                   1431:    might be outside the scanned area.  Fixed by more eager stack scanning.)
                   1432:  - PRINT_BLACK_LIST had some problems.  A few source addresses were garbage.
                   1433:  - Replaced Makefile.dj and added -I flags to cord make targets.
                   1434:    (Thanks to Gary Leavens.)
                   1435:  - GC_try_to_collect was broken with the nonincremental collector.
                   1436:  - gc_cleanup destructors could pass the wrong address to
                   1437:    GC_register_finalizer_ignore_self in the presence of multiple
                   1438:    inheritance.  (Thanks to Darrell Schiebel.)
                   1439:  - Changed PowerPC Linux stack finding code.
                   1440:
                   1441: Since 4.14alpha1
                   1442:  - -DSMALL_CONFIG did not work reliably with large (> 4K) pages.
                   1443:    Recycling the mark stack during expansion could result in a size
                   1444:    zero heap segment, which confused things.  (This was probably also an
                   1445:    issue with the normal config and huge pages.)
                   1446:  - Did more work to make sure that callee-save registers were scanned
                   1447:    completely, even with the setjmp-based code.  Added USE_GENERIC_PUSH_REGS
                   1448:    macro to facilitate testing on machines I have access to.
                   1449:  - Added code to explicitly push register contents for win32 threads.
                   1450:    This seems to be necessary.  (Thanks to Pierre de Rop.)
                   1451:
                   1452: Since 4.14alpha2
                   1453:  - changed STACKBOTTOM for DJGPP (Thanks to Salvador Eduardo Tropea).
1.1.1.2   maekawa  1454:
                   1455: Since 4.14
                   1456:  - Reworked large block allocator.  Now uses multiple doubly linked free
                   1457:    lists to approximate best fit.
                   1458:  - Changed heap expansion heuristic.  Entirely free blocks are no longer
                   1459:    counted towards the heap size.  This seems to have a major impact on
                   1460:    heap size stability; the old version could expand the heap way too
                   1461:    much in the presence of large block fragmentation.
                   1462:  - added -DGC_ASSERTIONS and some simple assertions inside the collector.
                   1463:    This is mainlyt for collector debugging.
                   1464:  - added -DUSE_MUNMAP to allow the heap to shrink.  Suupported on only
                   1465:    a few UNIX-like platforms for now.
                   1466:  - added GC_dump_regions() for debugging of fragmentation issues.
                   1467:  - Changed PowerPC pointer alignment under Linux to 4.  (This needs
                   1468:    checking by someone who has one.  The suggestions came to me via a
                   1469:    rather circuitous path.)
                   1470:  - Changed the Linux/Alpha port to walk the data segment backwards until
                   1471:    it encounters a SIGSEGV.  The old way to find the start of the data
                   1472:    segment broke with a recent release.
                   1473:  - cordxtra.c needed to call GC_REGISTER_FINALIZER instead of
                   1474:    GC_register_finalizer, so that it would continue to work with GC_DEBUG.
                   1475:  - allochblk sometimes cleared the wrong block for debugging purposes
                   1476:    when it dropped blacklisted blocks.  This could result in spurious
                   1477:    error reports with GC_DEBUG.
                   1478:  - added MACOS X Server support.  (Thanks to Andrew Stone.)
                   1479:  - Changed the Solaris threads code to ignore stack limits > 8 MB with
                   1480:    a warning.  Empirically, it is not safe to access arbitrary pages
                   1481:    in such large stacks.  And the dirty bit implementation does not
                   1482:    guarantee that none of them will be accessed.
                   1483:  - Integrated Martin Tauchmann's Amiga changes.
                   1484:  - Integrated James Dominy's OpenBSD/SPARC port.
                   1485:
                   1486: Since 5.0alpha1
                   1487:  - Fixed bugs introduced in alpha1 (OpenBSD & large block initialization).
                   1488:  - Added -DKEEP_BACK_PTRS and backptr.h interface.  (The implementation
                   1489:    idea came from Al Demers.)
                   1490:
                   1491: Since 5.0alpha2
                   1492:  - Added some highly incomplete code to support a copied young generation.
                   1493:    Comments on nursery.h are appreciated.
                   1494:  - Changed -DFIND_LEAK, -DJAVA_FINALIZATION, and -DFINALIZE_ON_DEMAND,
                   1495:    so the same effect could be obtained with a runtime switch.   This is
                   1496:    a step towards standardizing on a single dynamic GC library.
                   1497:  - Significantly changed the way leak detection is handled, as a consequence
                   1498:    of the above.
                   1499:
                   1500: Since 5.0 alpha3
                   1501:  - Added protection fault handling patch for Linux/M68K from Fergus
                   1502:    Henderson and Roman Hodek.
                   1503:  - Removed the tests for SGI_SOURCE in new_gc_alloc.h.  This was causing that
                   1504:    interface to fail on nonSGI platforms.
1.1.1.3 ! maekawa  1505:  - Changed the Linux stack finding code to use /proc, after changing it
1.1.1.2   maekawa  1506:    to use HEURISTIC1.  (Thanks to David Mossberger for pointing out the
                   1507:    /proc hook.)
                   1508:  - Added HP/UX incremental GC support and HP/UX 11 thread support.
1.1.1.3 ! maekawa  1509:    Thread support is currently still flakey.
1.1.1.2   maekawa  1510:  - Added basic Linux/IA64 support.
                   1511:  - Integrated Anthony Green's PicoJava support.
                   1512:  - Integrated Scott Ananian's StrongARM/NetBSD support.
                   1513:  - Fixed some fairly serious performance bugs in the incremental
                   1514:    collector.  These have probably been there essentially forever.
                   1515:    (Mark bits were sometimes set before scanning dirty pages.
                   1516:    The reclaim phase unnecessarily dirtied full small object pages.)
                   1517:  - Changed the reclaim phase to ignore nearly full pages to avoid
                   1518:    touching them.
                   1519:  - Limited GC_black_list_spacing to roughly the heap growth increment.
                   1520:  - Changed full collection triggering heuristic to decrease full GC
                   1521:    frequency by default, but to explicitly trigger full GCs during
                   1522:    heap growth.  This doesn't always improve things, but on average it's
                   1523:    probably a win.
                   1524:  - GC_debug_free(0, ...) failed.  Thanks to Fergus Henderson for the
                   1525:    bug report and fix.
1.1       maekawa  1526:
1.1.1.3 ! maekawa  1527: Since 5.0 alpha4
        !          1528:  - GC_malloc_explicitly_typed and friends sometimes failed to
        !          1529:    initialize first word.
        !          1530:  - Added allocation routines and support in the marker for mark descriptors
        !          1531:    in a type structure referenced by the first word of an object.  This was
        !          1532:    introduced to support gcj, but hopefully in a way that makes it
        !          1533:    generically useful.
        !          1534:  - Added GC_requested_heapsize, and inhibited collections in nonincremental
        !          1535:    mode if the actual used heap size is less than what was explicitly
        !          1536:    requested.
        !          1537:  - The Solaris pthreads version of GC_pthread_create didn't handle a NULL
        !          1538:    attribute pointer.  Solaris thread support used the wrong default thread
        !          1539:    stack size.  (Thanks to Melissa O'Neill for the patch.)
        !          1540:  - Changed PUSH_CONTENTS macro to no longer modify first parameter.
        !          1541:    This usually doesn't matter, but it was certainly an accident waiting
        !          1542:    to happen ...
        !          1543:  - Added GC_register_finalizer_no_order and friends to gc.h.  They're
        !          1544:    needed by Java implementations.
        !          1545:  - Integrated a fix for a win32 deadlock resulting from clock() calling
        !          1546:    malloc.  (Thanks to Chris Dodd.)
        !          1547:  - Integrated Hiroshi Kawashima's port to Linux/MIPS.  This was designed
        !          1548:    for a handheld platform, and may or may not be sufficient for other
        !          1549:    machines.
        !          1550:  - Fixed a va_arg problem with the %c specifier in cordprnt.c.  It appears
        !          1551:    that this was always broken, but recent versions of gcc are the first to
        !          1552:    report the (statically detectable) bug.
        !          1553:  - Added an attempt at a more general solution to dlopen races/deadlocks.
        !          1554:    GC_dlopen now temporarily disables collection.  Still not ideal, but ...
        !          1555:  - Added -DUSE_I686_PREFETCH, -DUSE_3DNOW_PREFETCH, and support for IA64
        !          1556:    prefetch instructions.  May improve performance measurably, but I'm not
        !          1557:    sure the code will run correctly on processors that don't support the
        !          1558:    instruction.  Won't build except with very recent gcc.
        !          1559:  - Added caching for header lookups in the marker.  This seems to result
        !          1560:    in a barely measurable performance gain.  Added support for interleaved
        !          1561:    lookups of two pointers, but unconfigured that since the performance
        !          1562:    gain is currently near zero, and it adds to code size.
        !          1563:  - Changed Linux DATA_START definition to check both data_start and
        !          1564:    __data_start, since nothing else seems to be portable.
        !          1565:  - Added -DUSE_LD_WRAP to optionally take advantage of the GNU ld function
        !          1566:    wrapping mechanism.  Probably currently useful only on Linux.
        !          1567:  - Moved some variables for the scratch allocator into GC_arrays, on
        !          1568:    Martin Hirzel's suggestion.
        !          1569:  - Fixed a win32 threads bug that caused the collector to not look for
        !          1570:    interior pointers from one of the thread stacks without
        !          1571:    ALL_INTERIOR_POINTERS.  (Thanks to Jeff Sturm.)
        !          1572:  - Added Mingw32 support.  (Thanks again to Jeff Sturm for the patch.)
        !          1573:  - Changed the alpha port to use the generic register scanning code instead
        !          1574:    of alpha_mach_dep.s.  Alpha_mach_dep.s doesn't look for pointers in fp
        !          1575:    registers, but gcc sometimes spills pointers there.  (Thanks to Manuel
        !          1576:    Serrano for helping me debug this by email.)  Changed the IA64 code to
        !          1577:    do something similar for similar reasons.
        !          1578:
        !          1579: Since 5.0alpha6:
        !          1580:  - -DREDIRECT_MALLOC was broken in alpha6. Fixed.
        !          1581:  - Cleaned up gc_ccp.h slightly, thus also causing the HP C++ compiler to
        !          1582:    accept it.
        !          1583:  - Removed accidental reference to dbg_mlc.c, which caused dbg_mlc.o to be
        !          1584:    linked into every executable.
        !          1585:  - Added PREFETCH to bitmap marker.  Changed it to use the header cache.
        !          1586:  - GC_push_marked sometimes pushed one object too many, resulting in a
        !          1587:    segmentation fault in GC_mark_from_mark_stack.  This was probably an old
        !          1588:    bug.  It finally showed up in gctest on win32.
        !          1589:  - Gc_priv.h erroneously #defined GC_incremental to be TRUE instead of FALSE
        !          1590:    when SMALL_CONFIG was defined.  This was no doubt a major performance bug for
        !          1591:    the default win32 configuration.
        !          1592:  - Removed -DSMALL_CONFIG from NT_MAKEFILE.  It seemed like an anchronism now
        !          1593:    that the average PC has 64MB or so.
        !          1594:  - Integrated Bryce McKinley's patches for linux threads and dynamic loading
        !          1595:    from the libgcj tree.  Turned on dynamic loading support for Linux/PPC.
        !          1596:  - Changed the stack finding code to use environ on HP/UX.  (Thanks
        !          1597:    to Gustavo Rodriguez-Rivera for the suggestion.)  This should probably
        !          1598:    be done on other platforms, too.  Since I can't test those, that'll
        !          1599:    wait until after 5.0.
        !          1600:
        !          1601: Since 5.0alpha7:
        !          1602:  - Fixed threadlibs.c for linux threads.  -DUSE_LD_WRAP was broken and
        !          1603:    -ldl was omitted.  Fixed Linux stack finding code to handle
        !          1604:    -DUSE_LD_WRAP correctly.
        !          1605:  - Added MSWIN32 exception handler around marker, so that the collector
        !          1606:    can recover from root segments that are unmapped during the collection.
        !          1607:    This caused occasional failures under Windows 98, and may also be
        !          1608:    an issue under Windows NT/2000.
        !          1609:
        !          1610: Since 5.0
        !          1611:  - Fixed a gc.h header bug which showed up under Irix.  (Thanks to
        !          1612:    Dan Sullivan.)
        !          1613:  - Fixed a typo in GC_double_descr in typd_mlc.c not getting traced correctly.
        !          1614:    This probably could result in objects described by array descriptors not
        !          1615:    getting traced correctly.  (Thanks to Ben Hutchings for pointing this out.)
        !          1616:  - The block nearly full tests in reclaim.c were not correct for 64 bit
        !          1617:    environments.  This could result in unnecessary heap growth under unlikely
        !          1618:    conditions.
        !          1619:  - Removed use of CLEAR_DOUBLE from generic reclaim code, since odd sizes
        !          1620:    could occur.
        !          1621:
        !          1622: Since 5.1
        !          1623:  - dyn_load.c declared GC_scratch_last_end_ptr as an extern even if it
        !          1624:    was defined as a macro.  This prevented the collector from building on
        !          1625:    Irix.
        !          1626:  - We quietly assumed that indirect mark descriptors were never 0.
        !          1627:    Our own typed allocation interface violated that.  This could result
        !          1628:    in segmentation faults in the marker with typed allocation.
        !          1629:  - Fixed a _DUSE_MUNMAP bug in the heap block allocation code.
        !          1630:    (Thanks to Ben Hutchings for the patch.)
        !          1631:  - Taught the collector about VC++ handling array operator new.
        !          1632:    (Thanks again to Ben Hutchings for the patch.)
        !          1633:  - The two copies of gc_hdrs.h had diverged.  Made one a link to the other
        !          1634:    again.
        !          1635:
        !          1636: Since 5.2
        !          1637:  - Fixed _end declaration for OSF1.
        !          1638:  - There were lots of spurious leak reports in leak detection mode, caused
        !          1639:    by the fact that some pages were not being swept, and hence unmarked
        !          1640:    objects weren't making it onto free lists.  (This bug dated back to 5.0.)
        !          1641:  - Fixed a typo in the liblinuxgc.so Makefile rule.
        !          1642:  - Added the GetExitCodeThread to Win32 GC_stop_world to (mostly) work
        !          1643:    around a Windows 95 GetOpenFileName problem.  (Thanks to Jacob Navia.)
        !          1644:
1.1       maekawa  1645: To do:
1.1.1.3 ! maekawa  1646:  - Integrate Linux/SPARC fixes.
1.1       maekawa  1647:  - Very large root set sizes (> 16 MB or so) could cause the collector
                   1648:    to abort with an unexpected mark stack overflow.  (Thanks again to
                   1649:    Peter Chubb.)  NOT YET FIXED.  Workaround is to increase the initial
                   1650:    size.
                   1651:  - The SGI version of the collector marks from mmapped pages, even
                   1652:    if they are not part of dynamic library static data areas.  This
                   1653:    causes performance problems with some SGI libraries that use mmap
                   1654:    as a bitmap allocator.  NOT YET FIXED.  It may be possible to turn
                   1655:    off DYNAMIC_LOADING in the collector as a workaround.  It may also
                   1656:    be possible to conditionally intercept mmap and use GC_exclude_static_roots.
                   1657:    The real fix is to walk rld data structures, which looks possible.
                   1658:  - Integrate MIT and DEC pthreads ports.
1.1.1.2   maekawa  1659:  - Incremental collector should handle large objects better.  Currently,
                   1660:    it looks like the whole object is treated as dirty if any part of it
                   1661:    is.
1.1.1.3 ! maekawa  1662:  - Cord/cordprnt.c doesn't build on a few platforms (notably PowerPC), since
        !          1663:    we make some unwarranted assumptions about how varargs are handled.  This
        !          1664:    currently makes the cord-aware versions of printf unusable on some platforms.
        !          1665:    Fixing this is unfortunately not trivial.

FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>