This document analyzes efficient stream processing on modern hardware. It identifies sources of inefficiency in current streaming systems and explores design changes to better utilize hardware, such as avoiding managed runtimes and queues. The authors show that an optimized scale-up solution could achieve up to two orders of magnitude performance improvement over state-of-the-art streaming systems through techniques like hardware-tailored compilation and late data merging.