Transfer the entry point and initial parameters in the callee_saved struct rather than on the stack. This saves 48 byte stack per thread and simplifies the logic. Signed-off-by: Julius Barendt <julius.barendt@gaisler.com>