Just curious - does it make sense to have a solution specifically optimized for tree-like articulated systems? (most of the time that is the case in my simulations... not too interested in having a box-pyramid and then running cars into it ;) I'd assume that you only need to carry resultant forces >forward< in an iterative system that is solving a tree-structured system.