Parallel and Asynchronous Programming in Java 8
Parallel code, which is code that runs on more than one thread, was once the nightmare of many an experienced developer, but Java 8 brought a lot of changes that should make this performance-boosting trick a lot more manageable.
Parallel Streams
Before Java 8 there was a big difference between parallel (or concurrent) code and sequential code. It was also very hard to debug non-sequential code. Simply setting a breakpoint and going through the flow like you would normally do, would remove the parallel aspect, which is a problem if that is what is causing the bug.
Luckily, Java 8 gave us streams, the greatest thing for Java developers since the bean. If you don’t know what they are, the Stream API makes it possible to handle sequences of elements in a functional matter. (Check our comparison between streams and .NET’s LINQ here.) One of the advantages of streams is that the structure of the code stays the same: whether it’s sequential or concurrent, it stays just as readable.
To make your code run parallel, you simply use .parallelStream()
instead of .stream()
, (or stream
.parallel()
, if you are not the creator of the stream).
But just because it’s easy, doesn’t mean that parallel code is always the best choice. You should always consider whether it makes any sense to use concurrency for your piece of code. The most important factor in that decision will be the speed: only use concurrency if it makes your code faster than its sequential counterpart.
The speed question
Parallel code gets its speed benefit from using multiple threads instead of the single one that sequential code uses. Deciding how many threads to create can be a tricky question because more threads don’t always result in faster code: if you use too many threads the performance of your code might actually go down.
There are a couple of rules that will tell you what number of threads to choose. This depends mostly on the kind of operation that you want to perform and the number of available cores.
Computation intensive operations should use a number of threads lower than or equal to the number of cores, while IO intensive operations like copying files have no use for the CPU and can therefore use a higher number of threads. The code doesn’t know which case is applicable unless you tell it what to do. Otherwise, it will default to a number of threads equal to the number of cores.
There are two main cases when it can be useful to run your code parallel instead of sequential: time-consuming tasks and tasks run on big collections. Java 8 brought a new way of handling those big collections, namely with streams. Streams have built-in efficiency by laziness: they use lazy evaluation which saves resources by not doing more than necessary. This is not the same as parallelism, which doesn’t care about the resources as long as it goes faster. So for big collections, you probably don’t need classic parallelism.
======================================================================
Going Async
Lessons from JavaScript
It is a rare occurrence that a Java developer can say that they learned something from looking at JavaScript, but when it comes to asynchronous programming, JavaScript actually got it right first. As a fundamentally async language, JavaScript has a lot of experience with how painful it can be when badly implemented. It started with callbacks and was later replaced by promises. An important benefit of promises is that it has two “channels”: one for data and one for errors. A JavaScript promise might look something like this:
func .then(f1) .catch(e1) .then(f2) .catch(e2);
So when the original function has a successful result, f1 is called, but if an error was thrown e1 will be called. This might bring it back to the successful track (f2), or result in another error (e2). You can go from data track to error track and back.
The Java version of JavaScript promises is called CompletableFuture.
CompletableFuture
CompletableFuture
implements both the Future
and the CompletionStage
interface. Future
already existed pre-Java8, but it wasn’t very developer-friendly by itself. You could only get the result of the asynchronous computation by using the .get()
method, which blocked the rest (making the async part pretty pointless most of the time) and you needed to implement each possible scenario manually. Adding the CompletionStage
interface was the breakthrough that made asynchronous programming in Java workable.
CompletionStage
is a promise, namely the promise that the computation will eventually be done. It contains a bunch of methods that let you attach callbacks that will be executed on that completion. Now we can handle the result without blocking.
There are two main methods that let you start the asynchronous part of your code: supplyAsync
if you want to do something with the result of the method, and runAsync
if you don’t.
CompletableFuture.runAsync(() → System.out.println("Run async in completable future " + Thread.currentThread())); CompletableFuture.supplyAsync(() → 5);
Callbacks
Now you can add those callbacks to handle the result of your supplyAsync
CompletableFuture.supplyAsync(() → 5) .thenApply(i → i * 3) .thenAccept(i → System.out.println(“The result is “ + i) .thenRun(() → System.out.println("Finished."));
.thenApply
is similar to the .map
function for streams: it performs a transformation. In the example above it takes the result (5) and multiplies it by 3. It will then pass that result (15) further down the pipe.
.thenAccept
performs a method on the result without transforming it. It will also not return a result. Here it will print “The result is 15” to the console. It can be compared to the .foreach
method for streams.
.thenRun
doesn’t use the result of the async operation and also doesn’t return anything, it just waits to call its Runnable
until the previous step is completed.
Asyncing your async
All of the above callback methods also come in an async version: thenRunAsync
, thenApplyAsync
, etc. These versions can run on their own thread and they give you extra control because you can tell it which ForkJoinPool
to use.
If you don’t use the async version, then the callbacks will all be executed on the same thread.
When things go wrong
When something goes wrong, the exceptionally
method is used to handle the exception. You can give it a method that returns a value to get back on the data track, or throw a (new) exception.
… .exceptionally(ex → new Foo()) .thenAccept(this::bar);
Combine and Compose
You can chain multiple CompletableFutures
by using the thenCompose
method. Without it, the result would be nested CompletableFutures
. This makes thenCompose
and thenApply
like flatMap
and map
for streams.
CompletableFuture.supplyAsync(() -> "Hello") .thenCompose(s -> CompletableFuture .supplyAsync(() -> s + "World"));
If you want to combine the result of two CompletableFutures
, you will need a method conveniently called thenCombine
.
future.thenCombine(future2, Integer::sum) .thenAccept(value → System.out.println(value));
As you can see in the example above, the result of the callback in thenCombine
can be handled like a normal CompletableFuture
with all your favourite CompletionStage
methods.
Conclusion
Parallel programming no longer needs to be an insurmountable obstacle in the hunt for faster code. Java 8 makes the process as straightforward as can be, so that any piece of code that could possibly benefit from it, can be pulled, kicking and screaming on all threads, into the multi-core future that is, in fact, just the present day. By which I mean: it’s easy to do, so give it a try and see its advantages for yourself.
Published on Java Code Geeks with permission by Lisa Steendam, partner at our JCG program. See the original article here: Parallel and Asynchronous Programming in Java 8 Opinions expressed by Java Code Geeks contributors are their own. |
Just as a conclusion to your post: I would say, use always sequential stream, only when you need performance improvements then consider using parallel stream, but solution with parallel stream requires testing and awareness of the thread nature. Parallel stream could lead in worst performance in case you do simple calculation on collection of 10 elements, simple because thread nature -> organizing problem into subproblems consumes time and if the job to do is small you will lose more time on this organization instead of actually solving the problem. And yes if you have collection of 1’000’000 it could lead… Read more »