1berry/ruby - ruby - Gitea : Git Mirror

Author	SHA1	Message	Date
Jean Boussier	f48e45d1e9	Move `object_id` in object fields. And get rid of the `obj_to_id_tbl` It's no longer needed, the `object_id` is now stored inline in the object alongside instance variables. We still need the inverse table in case `_id2ref` is invoked, but we lazily build it by walking the heap if that happens. The `object_id` concern is also no longer a GC implementation concern, but a generic implementation. Co-Authored-By: Matt Valentine-House <matt@eightbitraptor.com>	2025-05-08 07:58:05 +02:00
Jean Boussier	085cc6e434	Ractor: revert to moving object bytes, but size pool aware Using `rb_obj_clone` introduce other problems, such as `initialize_*` callbacks invocation in the context of the parent ractor. So we can revert back to copy the content of the object slots, but in a way that is aware of size pools.	2025-04-04 16:26:29 +02:00
Jean Boussier	0350290262	Ractor: Fix moving embedded objects [Bug #20271] [Bug #20267] [Bug #20255] `rb_obj_alloc(RBASIC_CLASS(obj))` will always allocate from the basic 40B pool, so if `obj` is larger than `40B`, we'll create a corrupted object when we later copy the shape_id. Instead we can use the same logic than ractor copy, which is to use `rb_obj_clone`, and later ask the GC to free the original object. We then must turn it into a `T_OBJECT`, because otherwise just changing its class to `RactorMoved` leaves a lot of ways to keep using the object, e.g.: ``` a = [1, 2, 3] Ractor.new{}.send(a, move: true) [].concat(a) # Should raise, but wasn't. ``` If it turns out that `rb_obj_clone` isn't performant enough for some uses, we can always have carefully crafted specialized paths for the types that would benefit from it.	2025-03-31 12:01:55 +02:00
Peter Zhu	7b6e07ea93	Add rb_gc_object_metadata API This function replaces the internal rb_obj_gc_flags API. rb_gc_object_metadata returns an array of name and value pairs, with the last element having 0 for the name.	2025-02-19 09:47:28 -05:00
Nobuyoshi Nakada	4a2702dafb	Remove stale declaration for modular GC	2025-01-11 01:22:26 +09:00
Peter Zhu	516a6cd1ad	Check whether object is valid in allocation_info_tracer_compact When reference updating ObjectSpace.trace_object_allocations, we need to check whether the object is valid or not because it does not mark the object so the object may be dead. This can cause a segmentation fault if the object is on a free heap page. For example, the following script crashes: require "objspace" objs = [] ObjectSpace.trace_object_allocations do 1_000_000.times do objs << Object.new end end objs = nil # Free pages that the objs were on GC.start # Run compaction and check that it doesn't crash GC.compact	2024-12-16 12:24:24 -05:00
Peter Zhu	c45503f957	Add rb_gc_impl_active_gc_name to gc/gc_impl.h	2024-12-06 10:22:03 -05:00
Peter Zhu	eedb30d385	Use rb_gc_enable/rb_gc_disable_no_rest instead of ruby_disable_gc We should use the rb_gc_enable/rb_gc_disable_no_rest APIs instead of directly setting the ruby_disable_gc variable.	2024-12-05 16:21:37 -05:00
Peter Zhu	ce1ad1b816	Standardize on the name "modular GC" We have name fragmentation for this feature, including "shared GC", "modular GC", and "external GC". This commit standardizes the feature name to "modular GC" and the implementation to "GC library".	2024-12-05 10:33:26 -05:00
Matt Valentine-House	551be8219e	Place all non-default GC API behind USE_SHARED_GC So that it doesn't get included in the generated binaries for builds that don't support loading shared GC modules Co-Authored-By: Peter Zhu <peter@peterzhu.ca>	2024-11-25 13:05:23 +00:00
Kunshan Wang	8ae7c22972	Annotate anonymous mmap Use PR_SET_VMA_ANON_NAME to set human-readable names for anonymous virtual memory areas mapped by `mmap()` when compiled and run on Linux 5.17 or higher. This makes it convenient for developers to debug mmap.	2024-11-21 13:48:05 -05:00
Matt Valentine-House	ee290c94a3	Include the currently active GC in RUBY_DESCRIPTION This will add +MOD_GC to the version string and Ruby description when Ruby is compiled with shared gc support. When shared GC support is compiled in and a GC module has been loaded using RUBY_GC_LIBRARY, the version string will include the name of the currently active GC as reported by the rb_gc_active_gc_name function in the form +MOD_GC[gc_name] [Feature #20794]	2024-11-14 10:46:36 +00:00
Matt Valentine-House	8e7df4b7c6	Rename size_pool -> heap Now that we've inlined the eden_heap into the size_pool, we should rename the size_pool to heap. So that Ruby contains multiple heaps, with different sized objects. The term heap as a collection of memory pages is more in memory management nomenclature, whereas size_pool was a name chosen out of necessity during the development of the Variable Width Allocation features of Ruby. The concept of size pools was introduced in order to facilitate different sized objects (other than the default 40 bytes). They wrapped the eden heap and the tomb heap, and some related state, and provided a reasonably simple way of duplicating all related concerns, to provide multiple pools that all shared the same structure but held different objects. Since then various changes have happend in Ruby's memory layout: * The concept of tomb heaps has been replaced by a global free pages list, with each page having it's slot size reconfigured at the point when it is resurrected * the eden heap has been inlined into the size pool itself, so that now the size pool directly controls the free_pages list, the sweeping page, the compaction cursor and the other state that was previously being managed by the eden heap. Now that there is no need for a heap wrapper, we should refer to the collection of pages containing Ruby objects as a heap again rather than a size pool	2024-10-03 21:20:09 +01:00
Peter Zhu	5f20957b85	Move ruby_load_external_gc_from_argv to gc.h	2024-07-03 09:03:40 -04:00
Peter Zhu	51bd816517	[Feature #20470 ] Split GC into gc_impl.c This commit splits gc.c into two files: - gc.c now only contains code not specific to Ruby GC. This includes code to mark objects (which the GC implementation may choose not to use) and wrappers for internal APIs that the implementation may need to use (e.g. locking the VM). - gc_impl.c now contains the implementation of Ruby's GC. This includes marking, sweeping, compaction, and statistics. Most importantly, gc_impl.c only uses public APIs in Ruby and a limited set of functions exposed in gc.c. This allows us to build gc_impl.c independently of Ruby and plug Ruby's GC into itself.	2024-07-03 09:03:40 -04:00
Peter Zhu	214811974b	Add ruby_mimcalloc Many places call ruby_mimmalloc then MEMZERO. This can be reduced by using ruby_mimcalloc instead.	2024-04-24 15:30:43 -04:00
Peter Zhu	81240493a3	Remove unused rb_size_pool_slot_size	2024-04-18 10:19:42 -04:00
Matt Valentine-House	065710c0f5	Initialize external GC Library Co-Authored-By: Peter Zhu <peter@peterzhu.ca>	2024-04-15 19:50:47 +01:00
Peter Zhu	f14e52c8c4	Fix setting GC stress at boot when objspace not available	2024-03-27 09:39:23 -04:00
eileencodes	e16086b7f2	Refactor init_copy gc attributes This PR moves `rb_copy_wb_protected_attribute` and `rb_gc_copy_finalizer` into a single function called `rb_gc_copy_attributes` to be called by `init_copy`. This reduces the surface area of the GC API. Co-authored-by: Peter Zhu <peter@peterzhu.ca>	2024-03-26 14:29:36 -04:00
Peter Zhu	9cf754b648	Fix --debug=gc_stress flag ruby_env_debug_option gets called after Init_gc_stress, so the --debug=gc_stress flag never works.	2024-03-25 13:07:39 -04:00
Peter Zhu	e07441f05f	Make rb_aligned_malloc private It is not used anywhere else.	2024-03-20 10:27:41 -04:00
Peter Zhu	3f5f04afa7	Remove duplicated function prototype rb_gc_disable_no_rest	2024-03-18 16:39:04 -04:00
Peter Zhu	4469729558	Remove rb_raw_obj_info_basic It's not used outside of gc.c.	2024-03-18 10:19:11 -04:00
Peter Zhu	ff51dc5654	[Feature #20265 ] Remove rb_newobj_of and RB_NEWOBJ_OF	2024-03-14 12:53:04 -04:00
Peter Zhu	6ad347a105	Don't directly read the SIZE_POOL_COUNT in shapes This removes the assumption about SIZE_POOL_COUNT for shapes.	2024-03-13 09:55:52 -04:00
Peter Zhu	2fc551e34e	Simplify NEWOBJ_OF macro	2024-03-13 09:05:52 -04:00
Peter Zhu	2c349cf4b6	Use NEWOBJ_OF_ec in NEWOBJ_OF_0	2024-03-11 19:30:09 -04:00
Jean Boussier	2d80b6093f	Retire RUBY_MARK_UNLESS_NULL Marking `Qnil` or `Qfalse` works fine, having an extra macro to avoid it isn't needed.	2024-03-08 14:13:14 +01:00
Peter Zhu	5481dbef07	Make rb_define_finalizer_no_check private	2024-02-28 13:45:19 -05:00
Peter Zhu	dcc976add9	Remove unused rb_gc_id2ref_obj_tbl	2024-02-28 12:21:38 -05:00
Peter Zhu	78ae6dbb11	Remove rb_objspace_marked_object_p rb_objspace_marked_object_p is no longer used in the objspace module, so we can remove it.	2024-02-26 17:05:34 -05:00
Peter Zhu	7538703d1b	Make rb_objspace_data_type_memsize private rb_objspace_data_type_memsize is not used in the objspace module, so we can make it private.	2024-02-26 17:05:34 -05:00
Peter Zhu	c9b6cd4223	Remove unused rb_objspace_each_objects_without_setup	2024-02-26 14:34:24 -05:00
Peter Zhu	e65315a725	Extract imemo functions from gc.c into imemo.c	2024-02-22 11:35:09 -05:00
Peter Zhu	1d3b306753	Move rb_class_allocate_instance from gc.c to object.c	2024-02-14 13:43:02 -05:00
KJ Tsanaktsidis	25f5b83689	Fix crash when printing RGENGC_DEBUG=5 output from GC I was trying to debug an (unrelated) issue in the GC, and wanted to turn on the trace-level GC output by compiling it with -DRGENGC_DEBUG=5. Unfortunately, this actually causes a crash in newobj_init() because the code there tries to log the obj_info() of the newly created object. However, the object is not actually sufficiently set up for some of the things that obj_info() tries to do: * The instance variable table for a class is not yet initialized, and when using variable-length RVALUES, said ivar table is embedded in as-yet unitialized memory after the struct RValue. Attempting to read this, as obj_info() does, causes a crash. * T_DATA variables need to dereference their ->type field to print out the underlying C type name, which is not set up until newobj_fill() is called. To fix this, create a new method `obj_info_basic`, which dumps out only the parts of the object that are valid before the object is fully initialized. [Fixes #18795]	2024-01-11 10:44:57 +11:00
Peter Zhu	269c705f93	Fix compaction for generic ivars When generic instance variable has a shape, it is marked movable. If it it transitions to too complex, it needs to update references otherwise it may have incorrect references.	2023-11-24 13:29:04 -05:00
Jean Boussier	e5364ea496	rb_shape_transition_shape_capa: use optimal sizes transitions Previously the growth was 3(embed), 6, 12, 24, ... With this change it's now 3(embed), 8, 16, 32, 64, ... by default. However, since power of two isn't the best size for all allocators, if `malloc_usable_size` is vailable, we use it to discover the best offset. On Linux/glibc 2.35 for instance, the growth will be 3(embed), 7, 15, 31 to avoid wasting 8B per object. Test program: ```c size_t test(size_t slots) { size_t allocated = slots * VALUE_SIZE; void test_ptr = malloc(allocated); size_t wasted = malloc_usable_size(test_ptr) - allocated; free(test_ptr); fprintf(stderr, "slots = %lu, wasted_bytes = %lu\n", slots, wasted); return wasted; } int main(int argc, char argv[]) { size_t best_padding = 0; size_t padding = 0; for (padding = 0; padding <= 2; padding++) { size_t wasted = test(8 - padding); if (wasted == 0) { best_padding = padding; break; } } size_t index = 0; fprintf(stderr, "=============== naive ================\n"); size_t list_size = 4; for (index = 0; index < 10; index++) { test(list_size); list_size = 2; } fprintf(stderr, "=============== auto-padded (-%lu) ================\n", best_padding); list_size = 4; for (index = 0; index < 10; index ++) { test(list_size - best_padding); list_size = 2; } fprintf(stderr, "\n\n"); return 0; } ``` ``` ===== glibc ====== slots = 8, wasted_bytes = 8 slots = 7, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 8 slots = 8, wasted_bytes = 8 slots = 16, wasted_bytes = 8 slots = 32, wasted_bytes = 8 slots = 64, wasted_bytes = 8 slots = 128, wasted_bytes = 8 slots = 256, wasted_bytes = 8 slots = 512, wasted_bytes = 8 slots = 1024, wasted_bytes = 8 slots = 2048, wasted_bytes = 8 =============== auto-padded (-1) ================ slots = 3, wasted_bytes = 0 slots = 7, wasted_bytes = 0 slots = 15, wasted_bytes = 0 slots = 31, wasted_bytes = 0 slots = 63, wasted_bytes = 0 slots = 127, wasted_bytes = 0 slots = 255, wasted_bytes = 0 slots = 511, wasted_bytes = 0 slots = 1023, wasted_bytes = 0 slots = 2047, wasted_bytes = 0 ``` ``` ========== jemalloc ======= slots = 8, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 =============== auto-padded (-0) ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 ```	2023-10-23 09:33:15 +02:00
Peter Zhu	12102d101a	Fix crash in WeakMap during compaction WeakMap can crash during compaction because the st_insert could allocate memory.	2023-09-06 14:20:23 -04:00
Peter Zhu	9a8398a18f	Introduce rb_gc_remove_weak If we're during incremental marking, then Ruby code can execute that deallocates certain memory buffers that have been called with rb_gc_mark_weak, which can cause use-after-free bugs.	2023-09-05 14:32:15 -04:00
Matt Valentine-House	322548180d	Prevent rb_gc_mark_values from pinning objects This is an internal only function not exposed to the C extension API. It's only use so far is from rb_vm_mark, where it's used to mark the values in the vm->trap_list.cmd array. There shouldn't be any reason why these cannot move. This commit allows them to move by updating their references during the reference updating step of compaction. To do this we've introduced another internal function rb_gc_update_values as a partner to rb_gc_mark_values. This allows us to refactor rb_gc_mark_values to not pin	2023-08-31 19:31:18 +01:00
Peter Zhu	bfb395c620	Implement weak references in the GC [Feature #19783] This commit adds support for weak references in the GC through the function `rb_gc_mark_weak`. Unlike strong references, weak references does not mark the object, but rather lets the GC know that an object refers to another one. If the child object is freed, the pointer from the parent object is overwritten with `Qundef`. Co-Authored-By: Jean Boussier <byroot@ruby-lang.org>	2023-08-25 09:01:21 -04:00
Jean Boussier	fa30b99c34	Implement Process.warmup [Feature #18885] For now, the optimizations performed are: - Run a major GC - Compact the heap - Promote all surviving objects to oldgen Other optimizations may follow.	2023-07-17 11:20:15 +02:00
yui-knk	b481b673d7	[Feature #19719 ] Universal Parser Introduce Universal Parser mode for the parser. This commit includes these changes: * Introduce `UNIVERSAL_PARSER` macro. All of CRuby related functions are passed via `struct rb_parser_config_struct` when this macro is enabled. * Add CI task with 'cppflags=-DUNIVERSAL_PARSER' for ubuntu.	2023-06-12 18:23:48 +09:00
Peter Zhu	a0d1069e03	Make classes embedded on 32 bit Classes are now exactly 80 bytes when embedded, which perfectly fits the 3rd size pool on 32 bit systems.	2023-04-16 11:06:31 -04:00
Peter Zhu	b4571097df	Enable 5 size pools on 32 bit systems This commit will allow 32 bit systems to take advantage of VWA.	2023-04-11 11:25:12 -04:00
Matt Valentine-House	026321c5b9	[Feature #19474 ] Refactor NEWOBJ macros NEWOBJ_OF is now our canonical newobj macro. It takes an optional ec	2023-04-06 11:07:16 +01:00
Matt Valentine-House	879cda98a4	Remove dependancy of vm_core.h on shape.h so that now shape can happily include gc.h	2023-04-06 11:07:16 +01:00
Peter Zhu	1da2e7fca3	[Feature #19579 ] Remove !USE_RVARGC code (#7655 ) Remove !USE_RVARGC code [Feature #19579] The Variable Width Allocation feature was turned on by default in Ruby 3.2. Since then, we haven't received bug reports or backports to the non-Variable Width Allocation code paths, so we assume that nobody is using it. We also don't plan on maintaining the non-Variable Width Allocation code, so we are going to remove it.	2023-04-04 17:30:06 -04:00

1 2

96 Commits