Parallel streams can help for CPU-bound work on large collections when each element is independent and the work is heavy enough to amortize overhead. Pitfalls: they use `ForkJoinPool.commonPool` by default, they can be slower for small tasks, they are bad for blocking I/O, and side effects/shared mutable state can cause race conditions.