Native Reflection in Rust

Native Reflection in Rust(jack.wrenn.fyi)

277 points by jswrenn 3 years ago | 65 comments

davidhyde 3 years ago |

Great writeup! The defmt logging crate uses a linker script to extract debug symbols so that you get nicely formatted stack traces on embedded systems. It works on linux, macos and windows. I wonder if the same technique can be applied to this project. It needs a runner though so may not be the right approach.

https://github.com/knurling-rs/defmt

olvy0 3 years ago |

I've used very similar method, at work, to provide C++ "reflection" between my own system and a system from another team.

Basically, the other system is a dynamic library which sends and receives C structures from my application. Those structures are then mapped into a buffer that is supposed to have the same size and there are pointers with metadata pointing into the buffer that are supposed to be exactly like the struct elements. Those structures can have arbitrary complexity, and are passed around through type erasure (essentially char*).

I wrote a "reflection" code for the other team, which runs when they register the struct instance to be sent, checks if there's a matching PDB [0] around, reads it, and outputs a json including the metadata needed, which can then be used to define the structures' metadata on our side correctly.

This is all in C/C++ since in some contexts we have soft real-time requirements, else I would have used any of the many RPC frameworks available.

This has been working for several years now.

This is not a generic solution but it's good enough for in-house communication between 2 systems that are maintained by different parts of the organization, where the API between them, that like I said is based on passing around char* buffers, has been more or less set in stone a long time ago. Conway's law [1] and all that. Sigh.

[0] We are a Windows shop although the same thing should work with DWARF info, same as the OP library works. In fact he says "It may never work on Windows, which does not use DWARF to encode debug info" but I can say that the same approach does work on Windows, for C++ at least. The PDB format might be a tad undocumented, but its documentation has been improved in the last decade or so since I started working on my library. Writing some small test programs is enough to understand how to access it, if all you need is meta info on C-style structures. Other stuff is more... challenging. But it wasn't necessary for my use-case.

[1] https://en.wikipedia.org/wiki/Conway%27s_law

jagged-chisel 3 years ago | |

Was the other team completely unwilling to provide a header?

olvy0 3 years ago | | |

Yes, they are willing, that wasn't the problem. The problem was that the consuming app on my side is historically metadata driven, and historically tries to avoid having to recompile when the interface changes. We do that by keeping the code generic and by reading the interface from a database. This leads to faster iterations. The problem rises when we have to interface with any other system which is not generic and has its interface defined in H files.

Yeah, I know, it's our problem, not theirs. It's something I cannot fix on my own without a huge effort. I've tried pushing for it for more than a decade, and at some point my wish was sort of abducted by my boss' boss as an excuse to create a DSL [0]. This did solve some huge problems but also created many others. It didn't solve that char* / h file problem since it doesn't really have an FFI.

[0] Domain specific language, custom-made for our own internal users. I've come to hate DSLs since I have to support that one, which never wanted.

jeroenhd 3 years ago |

Does using DWARF info imply that this will break when you strip the resulting executable? I often strip my Rust binaries because it practically halves the application size, which can become quite a lot in a language where you're statically linking everything.

Regardless, quite an ingenious use of standard ELF features, I didn't think this would be possible in Rust without adding some kind of VM around reflection code.

jswrenn 3 years ago | |

Yes, unfortunately that's a tradeoff here. Rust does support splitting debug info into other files, but Deflect doesn't support loading split debuginfo yet.

HideousKojima 3 years ago | |

C# has similar issues where they have to be conservative about what them trim from binaries for AoT in case it is used for reflection, so I imagine you'd run into the same issues for almost any compiled language you want to implement reflection for.

Animats 3 years ago |

"When you call .reflect on a dyn Reflect value, deflect figures out its concrete type in four steps:"

* invokes local_type_id to get the memory address of your value’s static implementation of local_type_id

* maps that memory address to an offset in your application’s binary

* searches your application’s debug info for the entry describing the function at that offset

* parses that debugging information entry (DIE) to determine the type of local_type_id’s &self parameter.

This is a rather strange thing to bolt onto a language. I could see this as an external tool. The use case seems to be programs which used "async" so much they can't figure out the resulting state machine. External debug tools to view and examine the async state machine might be helpful.

My experience with Rust has been that debugging of safe code is just not a problem. Print statements and logging are enough.

8jy89hui 3 years ago |

This is a beautiful (hacky) demo of something that I didn't think was possible in Rust (yet). I hope other applications don't accidentally start using it just to discover that it doesn't work in release mode.

Very impressive work!

jswrenn 3 years ago | |

Oh, I should add a note about that. Fortunately, it's quite easy to tell Rust to generate debuginfo even in release mode.

kp995 3 years ago |

Can’t we rely more on Rust’s Pattern Matching and it’s strong type system?

Reflection seems more helpful when the programming language is little unsounded.

jswrenn 3 years ago | |

Absolutely! That's the approach that frunk [0] takes. Frunk (and other reflection libraries like it) are suitable for most use cases, and make better use of Rust's affordances.

My crate is suitable for cases where you cannot know (or control) the set of types you might need to reflect on in advance. It's primary use-cases are related to debugging.

[0]: https://docs.rs/frunk

halfmatthalfcat 3 years ago | | |

Is Frunk Rust's Shapeless (from Scala)?

Thaxll 3 years ago |

Today I learn that Rust does not have reflection.

estebank 3 years ago | |

Reflection is usually not available in AoT compiled languages. The prevalent Rust coding styles rely heavily on monomorphic data types and functions, meaning there's nothing left to reflect at runtime. But if you want to deal with trait objects and need to access the underlying type, you need to use Any::downcast or rely on annotations on every type you want to reflect on. Or now, leverage DWARF info on Linux with deflect.

planede 3 years ago | | |

That's runtime reflection.

Compile time reflection AFAIK is available in D and Zig, and is planned for C++.

omginternets 3 years ago | | |

What are monomorphic data types? What should be my first read on the subject?

Tuna-Fish 3 years ago | |

Reflection is typically provided by a runtime, and languages that don't have runtimes usually don't have it. You shouldn't expect a low-level systems language to have reflection. There is no zero-cost way of implementing it.

Joker_vD 3 years ago | | |

Except Rust has runtime: [0]. And so, usually, does C (in hosted implementations).

[0] https://doc.rust-lang.org/reference/runtime.html

spacechild1 3 years ago | | |

This is of course only true for runtime reflection. And which language does not have a runtime?

snordgren 3 years ago | |

Rust has very little influence from reflection-heavy languages like Java and C#. On their list of influences (https://doc.rust-lang.org/reference/influences.html), Java is not even mentioned, and C# is only mentioned for its attributes. There is very little overlap between the design philosophies that influenced Rust and Java/C#.

Ruts does not support inheritance either. But I have never missed either feature in a Rust program.

nestorD 3 years ago | |

The usual argument is that between having macro and focusing on a strong type system, there are very few legitimate usecase for reflection left in Rust.

unconed 3 years ago |

My version of Greenspun's Tenth [1] is that any sufficiently complex static language contains an adhoc, informally specified, bug ridden and slow version of a dynamic "any" type.

Thx OP for providing an example.

[1] https://en.wikipedia.org/wiki/Greenspun's_tenth_rule

kibwen 3 years ago | |

Rust has a dynamic any type, `std::any::Any`.

unconed 3 years ago | | |

The entire purpose of OPs thing is to give you a semblance of workable reflection so you can actually operate on said type. It requires byzantine hacks to read debug info and doesn't work on macOS.

I don't think you understand how people in dynamic languages use any types at all.

bouk 3 years ago |

It would be really cool if it was possible to natively inspect the state of a Rust generator in a type-safe way

armchairhacker 3 years ago |

Does this still work if the application is complied in release mode or with optimizations?

Even if not, this is still very useful for debugging

jswrenn 3 years ago | |

It only works if DWARF is generated. By default, the `release` profile of Cargo sets `debug = false` [0]. But, it's quite easy to override this setting, and have a build that is both optimized and includes debuginfo.

[0]: https://doc.rust-lang.org/cargo/reference/profiles.html#rele...