Scala Tutorial – scripting, compiling, main methods, return values of functions
Preface
This is part 10 of tutorials for first-time programmers getting into Scala. Other posts are on this blog, and you can get links to those and other resources on the links page of the Computational Linguistics course I’m creating these for. Additionally you can find this and other tutorial series on the JCG Java Tutorials page.
The tutorials up to this point have been based on working with the Scala REPL or running basic scripts that are run from the command line. The latter is called “scripting” and usually is done for fairly simple, self-contained coding tasks. For more involved tasks that require a number of different modules and accessing libraries produced by others, it is necessary to work with a build system that brings together your code, others’ code, allows you to compile it, test it, and package it so that you can use it as an application.
This tutorial takes you from running Scala scripts to compiling Scala programs to create byte code that can be shared by different applications. This will act as a bridge to set you up for the next step of using a build system. Along the way, some points will be made about objects, extending on some of the ideas from the previous tutorial about object-oriented programming. At a high level, the relevance of objects to a larger, modularized code base should be pretty clear: objects encapsulate data and functions that can be used by other objects, and we need to be able to organize them so that objects know how to find other objects and class definitions. Build systems, which we’ll look at in the next tutorial, will make this straightforward.
Running Scala scripts
In the beginning, you started with the REPL.
scala> println("Hello, World!") Hello, World!
Of course, the REPL is just a (very useful) playground for trying out snippets of Scala code, not for doing real work. So, you saw that you could put code like println(“Hello, World!”) into a file called Hello.scala and run it from the command line.
$ scala Hello.scala Hello, World!
The homeworks and tutorials done so far have worked in this way, though they are a bit more complex. We can even include class definitions and objects created from a class. For example, using the Person class from the previous tutorial, we can put all the code into a file called People.scala (btw, this name doesn’t matter — could as well be Blurglecruncheon.scala).
class Person ( val firstName: String, val lastName: String, val age: Int, val occupation: String ) { def fullName: String = firstName + " " + lastName def greet (formal: Boolean): String = { if (formal) "Hello, my name is " + fullName + ". I'm a " + occupation + "." else "Hi, I'm " + firstName + "!" } } val johnSmith = new Person("John", "Smith", 37, "linguist") val janeDoe = new Person("Jane", "Doe", 34, "computer scientist") val johnDoe = new Person("John", "Doe", 43, "philosopher") val johnBrown = new Person("John", "Brown", 28, "mathematician") val people = List(johnSmith, janeDoe, johnDoe, johnBrown) people.foreach(person => println(person.greet(true)))
This can now be run from the command line, producing the expected result.
$ scala People.scala Hello, my name is John Smith. I'm a linguist. Hello, my name is Jane Doe. I'm a computer scientist. Hello, my name is John Doe. I'm a philosopher. Hello, my name is John Brown. I'm a mathematician.
However, suppose you wanted to use the Person class from a different application (e.g. that is defined in a different file). You might think you could save the following in the file Radiohead.scala, and then run it with Scala.
val thomYorke = new Person("Thom", "Yorke", 43, "musician") val johnnyGreenwood = new Person("Johnny", "Greenwood", 39, "musician") val colinGreenwood = new Person("Colin", "Greenwood", 41, "musician") val edObrien = new Person("Ed", "O'Brien", 42, "musician") val philSelway = new Person("Phil", "Selway", 44, "musician") val radiohead = List(thomYorke, johnnyGreenwood, colinGreenwood, edObrien, philSelway) radiohead.foreach(bandmember => println(bandmember.greet(false)))
However, if you do “scala Radiohead.scala” you’ll see five errors, each one complaining that the type Person wasn’t found. How could Radiohead.scala know about the Person class and where to find its definition? I’m not aware of a way to do this with scripting-style Scala programming, and even though I suspect there may be a way to do something this simple, I don’t even care to know it. Let’s just get straight to compiling.
Compiling
The usual thing we do with Scala is to compile our programs to byte code. We won’t go into the details of that, but it basically means that Scala turns the text of a Scala program into a compiled set of machine instructions that can be interpreted by your operating system. (It actually compiles to Java byte code, which is one reason it is pretty straightforward to use Java code when coding in Scala.)
So, what does compilation look like? We need to start by changing the code we did above a bit. Make a directory that has nothing in it, say /tmp/tutorial. Then save the following as PersonApp.scala in that directory.
class Person ( val firstName: String, val lastName: String, val age: Int, val occupation: String ) { def fullName: String = firstName + " " + lastName def greet (formal: Boolean): String = { if (formal) "Hello, my name is " + fullName + ". I'm a " + occupation + "." else "Hi, I'm " + firstName + "!" } } object PersonApp { def main (args: Array[String]) { val johnSmith = new Person("John", "Smith", 37, "linguist") val janeDoe = new Person("Jane", "Doe", 34, "computer scientist") val johnDoe = new Person("John", "Doe", 43, "philosopher") val johnBrown = new Person("John", "Brown", 28, "mathematician") val people = List(johnSmith, janeDoe, johnDoe, johnBrown) people.foreach(person => println(person.greet(true))) } }
Notice that the code looks pretty similar to the script above, but now we have a PersonApp object with a main method. The main method contains all the stuff that the original script had after the Person definition. Notice also that there is an args argument to the main method, which should look familiar now. What you are seeing is that a Scala script is basically just a simplified view of an object with a main method. Such scripts use the convention that the Array[String] provided to the method is called args.
Okay, so now consider what happens if you run “scala PersonApp.scala” — nothing at all. That’s because there is no executable code available outside of the object and class definitions. Instead, we need to compile the code and then run the main method of specific objects. The next step is to run scalac (N.B. “scalac” with a “c”, not “scala”) on PersonApp.scala. The name scalac is short for Scala compiler. Do the following steps in the /tmp/tutorial directory.
$ scalac PersonApp.scala $ ls Person.class PersonApp.class PersonApp$$anonfun$main$1.class PersonApp.scala PersonApp$.class
Notice that a number of *.class files have been generated. These are byte code files that the scala application is able to run. A nice thing here is that it all the compilation is done: when in the past you ran “scala” on your programs (scripts), it had to first compile the instructions and then run the program. Now we are separating these steps into a compilation phase and a running phase.
Having generated the class files, we can run any object that has a main method, like PersonApp.
$ scala PersonApp Hello, my name is John Smith. I'm a linguist. Hello, my name is Jane Doe. I'm a computer scientist. Hello, my name is John Doe. I'm a philosopher. Hello, my name is John Brown. I'm a mathematician.
Try running “scala Person” to see the error message it gives you.
Next, move the Radiohead.scala script that you saved earlier into this directory and run it.
$ scala Radiohead.scala Hi, I'm Thom! Hi, I'm Johnny! Hi, I'm Colin! Hi, I'm Ed! Hi, I'm Phil!
This is the same script, but now it is in a directory that contains the Person.class file, which tells Scala everything that Radiohead.scala needs to construct objects of the Person class. Scala makes available any class file that is defined in the CLASSPATH, an environment variable that by default includes the current working directory.
Despite this success, we’re going away from script land with this post, so change the contents of Radiohead.scala to be the following.
object RadioheadGreeting { def main (args: Array[String]) { val thomYorke = new Person("Thom", "Yorke", 43, "musician") val johnnyGreenwood = new Person("Johnny", "Greenwood", 39, "musician") val colinGreenwood = new Person("Colin", "Greenwood", 41, "musician") val edObrien = new Person("Ed", "O'Brien", 42, "musician") val philSelway = new Person("Phil", "Selway", 44, "musician") val radiohead = List(thomYorke, johnnyGreenwood, colinGreenwood, edObrien, philSelway) radiohead.foreach(bandmember => println(bandmember.greet(false))) } }
Then run scalac on all of the *.scala files in the directory. There are now more class files, corresponding to the RadioheadGreeting object we defined.
$ scalac *.scala $ ls Person.class Radiohead.scala PersonApp$$anonfun$main$1.class RadioheadGreeting$$anonfun$main$1.class PersonApp$.class RadioheadGreeting$.class PersonApp.class RadioheadGreeting.class PersonApp.scala
You can now run “scala RadioheadGreeting” to get the greeting from the band members. Notice that the file RadioheadGreeting was saved in was called Radiohead.scala and that no class files were generated called Radiohead.class, etc. Again, the file name could have been named something entirely different, like Turlingdrome.scala. (Embrace your inner Vogon.)
Multiple objects in the same file
There is no problem having multiple objects with main methods in the same file. When you compile the file with scalac, each object generates its own set of class files, and you call scala on whichever class file contains the definition for the main method you want to run. As an example, save the following as Greetings.scala.
object Hello { def main (args: Array[String]) { println("Hello, world!") } } object Goodbye { def main (args: Array[String]) { println("Goodbye, world!") } } object SayIt { def main (args: Array[String]) { args.foreach(println) } }
Next compile the file and then you can run any of the generated class files (since they all have main methods).
$ scalac Greetings.scala $ scala Hello Hello, world! $ scala Goodbye Goodbye, world! $ scala Goodbye many useless arguments Goodbye, world! $ scala SayIt "Oh freddled gruntbuggly" "thy micturations are to me" "As plurdled gabbleblotchits on a lurgid bee." Oh freddled gruntbuggly thy micturations are to me As plurdled gabbleblotchits on a lurgid bee.
In case you missed it earlier, the args array is where the command line arguments go and you can thus make use of them (or not, as in the case of the Hello and Goodbye objects).
Functions with return values versus those without
Some functions return a value while others do not. As a simple example, consider the following pairs of functions.
scala> def plusOne (x: Int) = x+1 plusOne: (x: Int)Int scala> def printPlusOne (x: Int) = println(x+1) printPlusOne: (x: Int)Unit
The first takes an Int argument and returns an Int, which is a value. The other takes an Int and returns Unit, which is to say it doesn’t return a value. Notice the difference in behavior between the two following uses of the functions.
scala> val foo = plusOne(2) foo: Int = 3 scala> val bar = printPlusOne(2) 3 bar: Unit = ()
Scala uses a slightly subtle distinction in function definitions that can distinguish functions that return values versus those that return Unit (no value): If you don’t use an equals sign in the definition, it means that the function returns Unit.
scala> def plusOneNoEquals (x: Int) { x+1 } plusOneNoEquals: (x: Int)Unit scala> def printPlusOneNoEquals (x: Int) { println(x+1) } printPlusOneNoEquals: (x: Int)Unit
Notice that the above definition of plusOneNoEquals returns Unit, even though it looks almost identical to plusOne defined earlier. Check it out.
scala> val foo = plusOneNoEquals(2) foo: Unit = ()
Now look back at the main methods given earlier. No equals. Yep, they don’t have a return value. They are the entry point into your code, and any effects of running the code must be output to the console (e.g. with println or via a GUI) or written to the file system (or the internet somewhere). The outputs of such functions (ones which do not return a value) are called side-effects. You need them for the main methods. However, in many styles of programming, a great deal of work is done with side-effects. I’ve been trying to gently lead the readers of this tutorial to adopt a more functional approach that tries to avoid them. I’ve found it a more effective style myself in my own coding, so I’m hoping it will serve you all better to start from that point. (Note that Scala supports many styles of programming, which is nice because you have choice and can go with what you find most suitable.)
Cleaning up
You may have noticed that the directory you are working in as you run scalac on your scala files becomes quite littered with class files. For example, here’s what the state of the code directory worked with in this tutorial looks like after compiling all files.
$ ls Goodbye$.class PersonApp.scala Goodbye.class Radiohead.scala Greetings.scala RadioheadGreeting$$anonfun$main$1.class Hello$.class RadioheadGreeting$.class Hello.class RadioheadGreeting.class Person.class SayIt$$anonfun$main$1.class PersonApp$$anonfun$main$1.class SayIt$.class PersonApp$.class SayIt.class PersonApp.class
A mess, right? Generally, one would rarely develop a Scala application by compiling it directly in this way. Instead a build system is used to manage the compilation process, organize the files, and allow one to easily access software libraries created by other developers. The next tutorial will cover this, using SBT (the Simple Build Tool).
Reference: First steps in Scala for beginning programmers, Part 10 from our JCG partner Jason Baldridge at the Bcomposes blog.
Related Articles :
- Scala Tutorial – Scala REPL, expressions, variables, basic types, simple functions, saving and running programs, comments
- Scala Tutorial – Tuples, Lists, methods on Lists and Strings
- Scala Tutorial – conditional execution with if-else blocks and matching
- Scala Tutorial – iteration, for expressions, yield, map, filter, count
- Scala Tutorial – regular expressions, matching
- Scala Tutorial – regular expressions, matching and substitutions with the scala.util.matching API
- Scala Tutorial – Maps, Sets, groupBy, Options, flatten, flatMap
- Scala Tutorial – scala.io.Source, accessing files, flatMap, mutable Maps
- Scala Tutorial – objects, classes, inheritance, traits, Lists with multiple related types, apply
- Scala Tutorial – SBT, scalabha, packages, build systems
- Scala Tutorial – code blocks, coding style, closures, scala documentation project
- Fun with function composition in Scala
- How Scala changed the way I think about my Java Code
- Testing with Scala
- Things Every Programmer Should Know