rust-book-cn/nostarch/chapter03.md
Carol (Nichols || Goulding) 29678e48ad Remove "distinct" from tuple explanation
Fixes #283.
2016-11-01 10:11:54 -04:00

1449 lines
46 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

[TOC]
# Common Programming Concepts in Rust
Lets first look at concepts that appear in almost every programming language
and see how they work in Rust. Many programming languages have much in common
at their core. None of the concepts presented in this chapter are unique to
Rust, but well discuss Rusts particular syntax and conventions concerning
these common concepts.
Specifically, well be talking about variables, basic types, functions,
comments, and control flow. These foundations will be in every Rust
program, and learning them early will give you a strong core to start from.
PROD: START BOX
Keep in mind as we get into variables and functions that the Rust language has
a set of *keywords* that have been reserved for use by the language only, much
like other languages do. This means you cannot use these words as names of
variables or functions, for example. Most of these have special meaning and we
will be using them to do various things in our Rust programs; a few have no
current functionality associated but have been reserved for functionality that
might be in the Rust language in the future. You can find a list of the
keywords in Appendix XX.
PROD: END BOX
## Variables and Mutability
We mentioned in Chapter 2 that by default, variables are *immutable*.
This is one of many nudges in Rust that encourages us to write our code in a
way that gets the most of the safety and easy concurrency that Rust has to
offer. We still have the option to make our variables mutable, though. Lets
explore how and why Rust encourages us to favor immutability, and why we might
want to opt out of that.
Variables being immutable means once a value is bound, you cant
change that value. To illustrate this, lets generate a new project in your
*projects* directory called `variables` by using `cargo new --bin variables`.
Then, in your new *variables* directory, open *src/main.rs* and replace its code
with the following:
Filename: src/main.rs
```rust,ignore
fn main() {
let x = 5;
println!("The value of x is: {}", x);
x = 6;
println!("The value of x is: {}", x);
}
```
Save and run the program using `cargo run`, and you should receive an error
message, as in this output:
```bash
$ cargo run
Compiling variables v0.0.1 (file:///projects/variables)
error: re-assignment of immutable variable `x` [--explain E0384]
--> src/main.rs:4:5
4 |> x = 6;
|> ^^^^^
note: prior assignment occurs here
--> src/main.rs:2:9
2 |> let x = 5;
|> ^
```
This is our first example of the compiler helping us find an error in our
program! Compiler errors can be frustrating. Keep in mind that they only mean
your program isnt safely doing what you want it to do yet; they do *not* mean
that youre not a good programmer! Experienced Rustaceans still get compiler
errors. The Rust compiler is trying to help your program be the very best.
PROD: START BOX
#### Extended Error Explanations
Now that youve seen a Rust error, lets take a moment to look at one
particularly useful aspect of errors. Rust encourages you to seek further
information on the kind of error youve received with output like this:
```bash
error: re-assignment of immutable variable `x` [--explain E0384]
```
This tells us that if we pass the `--explain` flag to `rustc` with the
provided error code, we can see an extended explanation which will try to
explain common causes of and solutions to that kind of error. Not every error
has a longer explanation, but many do. Heres a portion of the explanation
for the `E0384` error we received:
````bash
$ rustc --explain E0384
This error occurs when an attempt is made to reassign an immutable variable.
For example:
```
fn main(){
let x = 3;
x = 5; // error, reassignment of immutable variable
}
```
````
These explanations can really help if youre stuck on an error, so dont
hesitate to look up the error code. The compiler is your friend, and its
there to help.
PROD: END BOX
The error tells us that the cause of the error is `re-assignment of immutable
variable`, because we tried to assign a second value to the immutable `x`
variable.
Its important that we get compile-time errors when we attempt to change a
value that we previously said was immutable because this very situation can
lead to bugs. If one part of our code operates on an assumption that a value
will never change, and another part of our code changes that value, its
possible that the first part of the code wont do what it was designed to do.
This cause of bugs can be difficult to track down after the fact, especially
when the second piece of code only changes the value *sometimes*.
In Rust, we can trust that a value we say wont change really wont change,
because the compiler is enforcing that guarantee for us. When reading and
writing code, we dont have to keep track in our head how and where a value
might change. This can make code easier to reason about.
Mutability can be really useful, though! Variables are immutable only by
default; you can make them mutable by adding `mut` in front of the variable
name. In addition to allowing this value to be changed, it conveys intent to
future readers of the code by indicating that other parts of the code will be
changing this value.
For example, change the program you just wrote to the following:
Filename: src/main.rs
```rust
fn main() {
let mut x = 5;
println!("The value of x is: {}", x);
x = 6;
println!("The value of x is: {}", x);
}
```
Running this, we get:
```bash
$ cargo run
Compiling variables v0.1.0 (file:///projects/variables)
Running `target/debug/variables`
The value of x is: 5
The value of x is: 6
```
Using `mut`, we are allowed to change the value that `x` binds to from `5` to
`6`. In some cases youll want to make a variable mutable because it makes the
code easier to understand than an implementation that only uses immutable
variables. In cases where youre using large data structures, mutating an
instance in place may be faster than copying and returning newly allocated
instances. It all depends on the tradeoffs you want to make in your situation.
### Shadowing
As we saw in the guessing game tutorial, we can declare new variables with the
same name as a previous variable, and the new variable *shadows* the previous
variable. We say that the first variable is *shadowed* by the second, which means
that the second variables value is what you will see when you use the variable.
We can shadow a variable by using the same variables name and repeating the use
of the `let` keyword as follows:
Filename: src/main.rs
```rust
fn main() {
let x = 5;
let x = x + 1;
let x = x * 2;
println!("The value of x is: {}", x);
}
```
This program first binds `x` to a value of `5`. Then, it shadows `x` by
repeating `let x =`, taking the original value and adding `1` so that the value
of `x` is then `6`. The third `let` statement also shadows `x`, taking the
previous value and multiplying it by `2` to give `x` a final value of `12`. If
you run this, it will output:
```bash
$ cargo run
Compiling variables v0.1.0 (file:///projects/variables)
Running `target/debug/variables`
The value of x is: 12
```
This is different from marking a variable as `mut` because unless we use the
`let` keyword again, well get a compile-time error if we accidentally try to
reassign to this variable. We can perform a few transformations on a value, but
have the variable be immutable after those transformations have been completed.
The other difference between `mut` and shadowing is that, since were
effectively creating a new variable when we use the `let` keyword again, we can
change the type of the value, but reuse the same name. For
example, say we ask a user to show us how many spaces they want between some
text by sending us space characters, but we really want to store that as a
number:
```rust
let spaces = " ";
let spaces = spaces.len();
```
This is allowed: the first `spaces` variable is a string type, and the second
`spaces` variable, which is a brand new variable that happens to have the same
name as the first one, is a number type. Shadowing thus saves us from having to
come up with different names like `spaces_str` and `spaces_num`; we can reuse
the simpler `spaces` name. If we try to use `mut` for this, however, like this:
```rust,ignore
let mut spaces = " ";
spaces = spaces.len();
```
We will get a compile-time error because we are not allowed to mutate a
variables type:
```bash
error: mismatched types [--explain E0308]
-->
|>
4 |> spaces = spaces.len();
|> ^^^^^^^^^^^^ expected &-ptr, found usize
note: expected type `&str`
note: found type `usize`
error: aborting due to previous error
```
Now that weve explored how variables work, lets look at some more
data types they can have.
## Data Types
Every value in Rust is of a certain *type*, which tells Rust what kind of data
is being given so it knows how to work with that data. In this section, well
look at a number of types built into the language itself. We split the types
into two subsets: scalar and compound.
Something to keep in mind throughout this section: Rust is a *statically typed*
language, which means that it must know the types of all variables at compile
time. The compiler can usually infer what type we want to use based on the
value and how we use it. In cases when many types are possible, such as when we
converted a `String` to a numeric type using `parse` in Chapter 2, we must
add a type annotation, like this:
```rust
let guess: u32 = "42".parse().unwrap();
```
If we dont put the type annotation here, Rust will give us this error that
means the compiler needs more information from us to know which possible type
we want:
```bash
error: unable to infer enough type information about `_`; type annotations or
generic parameter binding required [--explain E0282]
-->
|>
3 |> let guess = "42".parse().unwrap();
|> ^^^^^
```
You will see some type annotations as we discuss the various data types.
### Scalar Types
A *scalar* type represents a single value. There are four primary scalar
types in Rust: integers, floating point numbers, booleans, and characters.
Youll likely recognize these from other programming languages, but lets jump
into how they work in Rust.
#### Integer Types
An *integer* is a number without a fractional component. Weve used one integer
type already in this chapter, the `i32` type. This type declaration indicates
that the value its associated with should be a signed integer (hence the `i`,
as opposed to a `u` for unsigned) for a 32-bit system. There are a number of
built-in integer types in Rust, shown in Table 3-1.
| Length | signed | unsigned |
|--------|--------|----------|
| 8-bit | i8 | u8 |
| 16-bit | i16 | u16 |
| 32-bit | i32 | u32 |
| 64-bit | i64 | u64 |
| arch | isize | usize |
*Table 3-1: Integer types in Rust. Each variant in the signed and unsigned
columns (for example, `i32`) can be used to declare the type of an integer
value.*
Each variant can be either signed or unsigned and has an explicit size. Signed
and unsigned merely refers to whether it is possible for the number to be
either negative or positive; in other words, whether the number needs to have a
sign with it (signed), or whether it will only ever be positive and can
therefore be represented without a sign (unsigned). Its like writing numbers
on paper: when the sign matters, a number is shown with a plus sign or minus
sign, but when its safe to assume the number is positive, its shown with no
sign. Signed numbers are stored using twos complement representation (if
youre unsure what this is you can search for it online; an explanation is
outside the scope of this text).
Each signed variant can store numbers from -(2<sup>n - 1</sup>) to 2<sup>n -
1</sup> - 1 inclusive, where `n` is the number of bits that variant uses. So an
`i8` can store from -(2<sup>7</sup>) to 2<sup>7</sup> - 1, which equals -128
to 127. Unsigned variants can store numbers from 0 to 2<sup>n</sup> - 1, so a
`u8` can store from 0 to 2<sup>8</sup> - 1, which equals 0 to 255.
Finally, the `isize` and `usize` types depend on the kind of computer your
program is running on: 64-bits if youre on a 64-bit architecture, and 32-bits
if youre on a 32-bit architecture.
You can write integer literals in any of the forms shown in Table 3-2. Note that
all number literals except for the byte literal allow a type suffix, such as
`57u8`, and `_` as a visual separator, such as `1_000`.
| Number literals | Example |
|------------------|---------------|
| Decimal | `98_222` |
| Hex | `0xff` |
| Octal | `0o77` |
| Binary | `0b1111_0000` |
| Byte (`u8` only) | `b'A'` |
*Table 3-2: Integer literals in Rust.*
So how do you know which type of integer to use? If youre unsure, Rusts
defaults are generally good choices, and integer types default to `i32`: its
generally the fastest, even on 64-bit systems. The primary situation in which
youd use `isize` or `usize` is when indexing some sort of collection.
#### Floating-Point Types
Rust also has two primitive types for *floating-point numbers*, which are
numbers with decimal points. Rusts floating-point types are `f32` and `f64`,
which are 32 bits and 64 bits in size, respectively. The default type is `f64`,
as its roughly the same speed as `f32`, but has a larger precision. It is
possible to use an `f64` on 32 bit systems, but it will be slower than using an
`f32` on those systems. Most of the time, trading potential worse performance
for better precision is a reasonable initial choice, and you should benchmark
your code if you suspect floating-point size is a problem in your case. See
Chapter XX for how to run benchmarks.
Heres an example showing floating-point numbers in action:
Filename: src/main.rs
```rust
fn main() {
let x = 2.0; // f64
let y: f32 = 3.0; // f32
}
```
Floating-point numbers are represented according to the IEEE-754 standard. The
`f32` type is a single-precision float, while `f64` has double-precision.
#### Numeric Operations
Rust supports the usual basic mathematic operations youd expect for all of
these number types: addition, subtraction, multiplication, division, and
remainder. This code shows how youd use each one in a `let` statement:
Filename: src/main.rs
```rust
fn main() {
// addition
let sum = 5 + 10;
// subtraction
let difference = 95.5 - 4.3;
// multiplication
let product = 4 * 30;
// division
let quotient = 56.7 / 32.2;
// remainder
let remainder = 43 % 5;
}
```
Each expression in these statements uses a mathematical operator and evaluates
to a single value, which is then bound to a variable. Appendix XX contains a
list of all operators that Rust provides.
#### The Boolean Type
As in most other programming languages, a boolean type in Rust has two possible
values: `true` and `false`. The boolean type in Rust is specified with `bool`.
For example:
Filename: src/main.rs
```rust
fn main() {
let t = true;
let f: bool = false; // with explicit type annotation
}
```
The main way to consume boolean values is through conditionals like an `if`
statement. Well cover how `if` statements work in Rust in the “Control Flow”
section of this chapter.
#### The Character Type
So far weve only worked with numbers, but Rust supports letters too. Rusts
`char` type is the languages most primitive alphabetic type, and this code
shows one way to use it:
Filename: src/main.rs
```rust
fn main() {
let c = 'z';
let z = '';
let heart_eyed_cat = '😻';
}
```
Rusts `char` represents a Unicode Scalar Value, which means that it can
represent a lot more than just ASCII. Accented letters, Chinese/Japanese/Korean
ideographs, emoji, and zero width spaces are all valid `char`s in Rust. Unicode
Scalar Values range from `U+0000` to `U+D7FF` and `U+E000` to `U+10FFFF`
inclusive. A “character” isnt really a concept in Unicode, however, so your
human intuition for what a “character” is may not match up with what a `char`
is in Rust. Well discuss this in detail in the Strings section of Chapter 8.
### Compound Types
*Compound types* can group multiple values of other types into one type. Rust
has two primitive compound types: tuples and arrays.
#### Grouping Values into Tuples
A tuple is a general way of grouping together some number of other values with
a variety of types into one compound type.
We create a tuple by writing a comma-separated list of values inside
parentheses. Each position in the tuple has a type, and the types of the
different values in the tuple dont have to be the same. Weve added optional
type annotations in this example:
Filename: src/main.rs
```rust
fn main() {
let tup: (i32, f64, u8) = (500, 6.4, 1);
}
```
The variable `tup` binds to the entire tuple, since a tuple is considered a
single compound element. To get the individual values out of a tuple, we can
use pattern matching to destructure a tuple value, like this:
Filename: src/main.rs
```rust
fn main() {
let tup = (500, 6.4, 1);
let (x, y, z) = tup;
println!("The value of y is: {}", y);
}
```
In this program, we first create a tuple and bind it to the variable `tup`. We
then use a pattern with `let` to take `tup` and turn it into three separate
variables, `x`, `y`, and `z`. This is called *destructuring*, because it breaks
the single tuple into three parts. Finally, we print the value of `y`, which is
`6.4`.
#### Tuple Indexing
In addition to destructuring through pattern matching, we can also access a
tuple element directly by using a period (`.`) followed by the index of the
value we want to access. For example:
Filename: src/main.rs
```rust
fn main() {
let x: (i32, f64, u8) = (500, 6.4, 1);
let five_hundred = x.0;
let six_point_four = x.1;
let one = x.2;
}
```
This program creates a tuple, `x`, and then makes new variables for each element
by using their index. As with most programming languages, the first index in a
tuple is 0.
### Arrays
Another way to have a collection of multiple values is with an
*array*. Unlike a tuple, every element of an array must have the same type.
Arrays in Rust are different than arrays in some other languages because arrays
in Rust have a fixed length: once declared, they cannot grow or shrink in size.
In Rust, the values going into an array are written as a comma separated list
inside square brackets:
Filename: src/main.rs
```rust
fn main() {
let a = [1, 2, 3, 4, 5];
}
```
While arrays can be useful since they are a primitive type so using them can be
very fast, they arent as flexible as the vector type. The vector type is a
similar collection type provided by the standard library that *is* allowed to
grow or shrink in size. If youre unsure whether to use an array or a vector,
you should probably go with a vector, and well discuss them in more detail in
Chapter 8.
An example of when we might want to use an array is storing the months of the
year. Its very unlikely that our program will need to add or remove months, so
we can use an array since we know we will always have 12 items:
```rust
let months = ["January", "February", "March", "April", "May", "June", "July",
"August", "September", "October", "November", "December"];
```
#### Accessing Array Elements
An array is a single chunk of memory, allocated on the stack. We can access
elements of an array using indexing, like this:
Filename: src/main.rs
```rust
fn main() {
let a = [1, 2, 3, 4, 5];
let first = a[0];
let second = a[1];
}
```
In this example, the variable named `first` will get the value `1`, since that
is the value at index `[0]` in the array. The variable named `second` will get
the value `2` from index `[1]` in the array.
#### Invalid Array Element Access
What happens if you try to access an element of an array past the end of the
array? Say we changed our program to:
Filename: src/main.rs
```rust,ignore
fn main() {
let a = [1, 2, 3, 4, 5];
let element = a[10];
println!("The value of element is: {}", element);
}
```
Running this code with `cargo run` produces:
```bash
$ cargo run
Compiling arrays v0.1.0 (file:///projects/arrays)
Running `target/debug/arrays`
thread '<main>' panicked at 'index out of bounds: the len is 5 but the index is
10', src/main.rs:4
note: Run with `RUST_BACKTRACE=1` for a backtrace.
error: Process didn't exit successfully: `target/debug/arrays` (exit code: 101)
```
We can see that the compilation did not give us any errors, but we got a
*runtime* error and our program didnt exit successfully. When we attempt to
access an element using indexing, Rust will check that the index weve
specified is less than the array length. If the index is greater than the
length, it will *panic*, which is what its called when a Rust program exits
with an error.
This is our first example of Rusts safety principles in action. In many
low-level languages, this kind of check is not done, and when you provide an
incorrect index, invalid memory can be accessed. Rust protects us against this
kind of error by immediately exiting instead of allowing the memory access and
continuing. Well discuss more of Rusts error handling in Chapter XX.
## How Functions Work
Functions are pervasive in Rust code. Weve already seen one of the most
important functions in the language: the `main` function thats the entry
point of many programs. Weve also seen the `fn` keyword, which allows us to
declare new functions.
Rust code uses *snake case* as the conventional style for function and variable
names. In snake case, all letters are lower case, and there are underscores
separating words. Heres a program containing an example function definition:
Filename: src/main.rs
```rust
fn main() {
println!("Hello, world!");
another_function();
}
fn another_function() {
println!("Another function.");
}
```
Function definitions in Rust start with `fn` and have a set of parentheses
after the function name. The curly braces tell the compiler where the function
body begins and ends.
We can call any function weve defined by entering its name followed by a pair
of parentheses. Since `another_function` is defined in the program, it can be
called from inside the `main` function. Note that we defined
`another_function` *after* the `main` function in our source code; we could
have defined it before as well. Rust doesnt care where you define your
functions, only that they are defined somewhere.
Lets start a new binary project named `functions` so that we can explore
further. Place the `another_function` example in *src/main.rs* and run it.
You should see the following output:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
Running `target/debug/functions`
Hello, world!
Another function.
```
The lines execute in the order they appear in the `main` function. First, our
“Hello, world!” message prints, and then `another_function` is called and its
message is printed.
### Function Arguments
Functions can also take arguments. The following rewritten version of
`another_function` shows what arguments look like in Rust:
Filename: src/main.rs
```rust
fn main() {
another_function(5);
}
fn another_function(x: i32) {
println!("The value of x is: {}", x);
}
```
Try running this program, and you should get this output:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
Running `target/debug/functions`
The value of x is: 5
```
In the declaration of `another_function`, we place one argument named `x`. We
specify the type of `x` as `i32`. When we pass `5` to `another_function`,
the `println!` macro puts `5` where the pair of curly braces were in the format
string.
In function signatures, we *must* declare the type. This is a deliberate
decision in the design of Rust; requiring type annotations in function
definitions means the compiler almost never needs you to use them elsewhere in
the code in order to figure out what you mean.
When you want a function to have multiple arguments, just separate them inside
the function signature with commas, like this:
Filename: src/main.rs
```rust
fn main() {
another_function(5, 6);
}
fn another_function(x: i32, y: i32) {
println!("The value of x is: {}", x);
println!("The value of y is: {}", y);
}
```
In this example, we make a function with two arguments, both of which are
`i32`s. If your function has multiple arguments, they dont need to be the same
type, but they just happen to be in this example. Our function then prints out
the values of both of its arguments.
Lets try out this code. Replace the program currently in your `function`
projects `main.rs` file with the example above, and run it as follows:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
Running `target/debug/functions`
The value of x is: 5
The value of y is: 6
```
Since `5` is passed as the `x` argument and `6` is passed as the `y` argument,
the two strings are printed with these values.
### Function Bodies
Function bodies are made up of a series of statements optionally ending in an
expression. So far, weve only seen functions without an ending expression, but
we have seen expressions as parts of statements. Since Rust is an
expression-based language, this is an important distinction to understand.
Other languages dont have the same distinctions, so lets look at what
statements and expressions are and how their differences affect the bodies of
functions.
#### Statements and Expressions
Weve already been using both statements and expressions. *Statements* are
instructions that perform some action and do not return a value. *Expressions*
evaluate to a resulting value. Lets look at some examples.
Creating a variable and assigning a value to it with the `let` keyword
is a statement. In this example, `let y = 6;` is a statement:
Filename: src/main.rs
```rust
fn main() {
let y = 6;
}
```
Function definitions are also statements; the entire previous example is a
statement in itself.
Statements do not return values themselves. Therefore, you cant assign a `let`
statement to another variable, as this code tries to do:
Filename: src/main.rs
```rust,ignore
fn main() {
let x = (let y = 6);
}
```
If we were to run this program, wed get an error like this:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
error: expected expression, found statement (`let`)
--> src/main.rs:2:14
2 |> let x = (let y = 6);
|> ^^^
note: variable declaration using `let` is a statement
error: aborting due to previous error
error: Could not compile `functions`.
```
The `let y = 6` statement does not return a value, so there isnt anything for
`x` to bind to. This is different than in other languages like C and Ruby where
the assignment returns the value of the assignment. In those languages, we can
write `x = y = 6` and have both `x` and `y` have the value `6`; that is not the
case in Rust.
Expressions are code that evaluate to something, and make up most of the rest
of the code that you will write in Rust. Consider a simple math operation, like
this:
```rust,ignore
5 + 6
```
This is an expression, and evaluating it results in the value `11`. Expressions
can be part of statements-- in the previous example that had the statement `let
y = 6;`, `6` is an expression that evaluates to the value `6`. Calling a
function is an expression. Calling a macro is an expression. The block that we
use to create new scopes, `{}`, is an expression, for example:
Filename: src/main.rs
```rust
fn main() {
let x = 5;
let y = {
let x = 3;
x + 1
};
println!("The value of y is: {}", y);
}
```
<!-- If we use wingding numbers to call out code, we might delete the
repetition here and just use those numbers--that can help the flow of the text.
Im flagging this as a reminder for when we transfer to libreoffice -->
The expression:
```rust,ignore
{
let x = 3;
x + 1
}
```
is a block that, in this case, evaluates to `4`, and then gets bound to
`y` as part of the `let` statement.
Note that the line containing `x + 1` does not have a semicolon at the end,
unlike most of the lines weve seen up until now. This is the most important
distinction between expressions and statements to remember: statements end in
semicolons while expressions do not. If you add a semicolon to the end of an
expression, that will turn it into a statement, which will then not return a
value. Keep this in mind as we explore function return values and expressions.
### Functions with Return Values
Functions can return values back to the code that calls them. We dont name
return values, but we do declare their type, after an arrow (`->`). In Rust,
the “return value of the function” is synonymous with the “value of the final
expression in the block of the body of a function.” Heres an example of a
function that returns a value:
Filename: src/main.rs
```rust
fn five() -> i32 {
5
}
fn main() {
let x = five();
println!("The value of x is: {}", x);
}
```
There are no function calls, macros, or even `let` statements in the `five`
function: just the number `5` by itself. Thats a perfectly valid function in
Rust. Note the functions return type is specified, too, as `-> i32`. Try
running this code, and the output should look like this:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
Running `target/debug/functions`
The value of x is: 5
```
The `5` in `five` is the functions return value, which is why the return type
is `i32`. Lets examine this in more detail. There are two important bits.
First, the line `let x = five();` shows us using the return value of a function
to initialize a variable.
Because the function `five` returns a `5`, that line is the same as saying:
```rust
let x = 5;
```
The second interesting bit is the `five` function itself. It requires no
arguments and defines the type of the return value, but the body of the
function is a lonely `5` with no semicolon because it is an expression whose
value we want to return. Lets look at another example:
Filename: src/main.rs
```rust
fn main() {
let x = plus_one(5);
println!("The value of x is: {}", x);
}
fn plus_one(x: i32) -> i32 {
x + 1
}
```
Running this code will print `The value of x is: 6`. What happens if we put a
semicolon at the end of the line containing `x + 1`, changing it from an
expression to a statement?
Filename: src/main.rs
```rust,ignore
fn main() {
let x = plus_one(5);
println!("The value of x is: {}", x);
}
fn plus_one(x: i32) -> i32 {
x + 1;
}
```
Running this code gives an error, as follows:
```bash
$ cargo run
Compiling functions v0.1.0 (file:///projects/functions)
error: not all control paths return a value [--explain E0269]
--> src/main.rs:7:1
7 |> fn plus_one(x: i32) -> i32 {
|> ^
help: consider removing this semicolon:
--> src/main.rs:8:10
8 |> x + 1;
|> ^
error: aborting due to previous error
error: Could not compile `functions`.
```
The main error message, “not all control paths return a value”, reveals the
core of the issue with this code. The definition of the function `plus_one`
says that it will return an `i32`, but statements dont evaluate to a value.
Therefore, nothing is returned, which contradicts the function definition and
results in an error. In this output, Rust gives an option to rectify this: it
suggests removing the semicolon, which would fix the error.
## Comments
All programmers strive to make their code easy to understand, but sometimes
extra explanation is warranted. In these cases, we leave notes in our source
code that the compiler will ignore but people reading the source code may find
useful. These notes are called *comments*.
Heres a simple comment:
```rust
// Hello, world.
```
In Rust, comments must start with two slashes and will last until the end of
the line. For comments that extend beyond a single line, youll need to
include `//` on each line, like this:
```rust
// So were doing something complicated here, long enough that we need
// multiple lines of comments to do it! Whew! Hopefully, this comment will
// explain whats going on.
```
Comments can also be placed at the end of lines of code:
Filename: src/main.rs
```rust
fn main() {
let lucky_number = 7; // Im feeling lucky today.
}
```
But youll more often see them above, like so:
Filename: src/main.rs
```rust
fn main() {
// Im feeling lucky today.
let lucky_number = 7;
}
```
Thats all there is to it. Comments are not particularly complicated.
## Control Flow
Decisions on whether or not to run some code depending on if a condition is
true, or deciding to run some code repeatedly while a condition is true, are
basic building blocks in most programming languages. The most common constructs
that let us control the flow of execution of our Rust code are `if` expressions
and loops.
### `if` Expressions
An `if` expression allows us to branch our code depending on conditions. We
provide a condition and then say, “If this condition is met, run this
block of code. If the condition is not met, do not run this block of code.”
Lets make a new project to explore `if`, called `branches`. In *src/main.rs*,
put:
Filename: src/main.rs
```rust
fn main() {
let number = 3;
if number < 5 {
println!("condition was true");
} else {
println!("condition was false");
}
}
```
All `if` expressions start with the keyword `if`, which is followed by a
condition. In this case, our condition is checking if our variable
`number` has a value that is less than 5. The block of code we want to execute
if the condition is true goes immediately after the condition, inside curly
braces. These blocks are sometimes called *arms*. We can optionally also
include an `else` expression, which we have chosen to do here. This gives the
program an alternative block of code to execute should the condition evaluate
to false. If you dont give an `else` expression and the condition is false,
the program will just skip the `if` block and move on to the next bit of code.
Try running this code, and you should see output like this:
```bash
$ cargo run
Compiling branches v0.1.0 (file:///projects/branches)
Running `target/debug/branches`
condition was true
```
Lets try changing the value of `number` to a value that makes the condition
`false` to see what happens:
```rust,ignore
let number = 7;
```
Run the program again, and look at the output:
```bash
$ cargo run
Compiling branches v0.1.0 (file:///projects/branches)
Running `target/debug/branches`
condition was false
```
Its also worth noting that the condition here *must* be a `bool`. To see what
happens if the condition isnt a `bool`, try running this code:
Filename: src/main.rs
```rust,ignore
fn main() {
let number = 3;
if number {
println!("number was three");
}
}
```
The `if` condition evaluates to a value of `3` this time, and Rust will
complain about it:
```bash
Compiling branches v0.1.0 (file:///projects/branches)
error: mismatched types [--explain E0308]
--> src/main.rs:4:8
4 |> if number {
|> ^^^^^^ expected bool, found integral variable
note: expected type `bool`
note: found type `_`
error: aborting due to previous error
Could not compile `branches`.
```
The error tells us that Rust expected a `bool`, but got an integer. Rust will
not automatically try to convert non-boolean types to a boolean here, unlike
languages like Ruby or JavaScript. We must be explicit and always give `if` a
`boolean` as its condition. If you wanted your `if` code block to run only when
a number is not equal to `0`, for example, we would change the `if` expression
to read:
Filename: src/main.rs
```rust
fn main() {
let number = 3;
if number != 0 {
println!("number was something other than zero");
}
}
```
Running this will print `number was something other than zero`.
#### Multiple Conditions with `else if`
We can have multiple conditions by combining `if` and `else` in an `else if`
expression. For example:
Filename: src/main.rs
```rust
fn main() {
let number = 6;
if number % 4 == 0 {
println!("number is divisible by 4");
} else if number % 3 == 0 {
println!("number is divisible by 3");
} else if number % 2 == 0 {
println!("number is divisible by 2");
} else {
println!("number is not divisible by 4, 3, or 2");
}
}
```
This program has four possible paths it can take. If you try running it, you
should see output like this:
```bash
$ cargo run
Compiling branches v0.1.0 (file:///projects/branches)
Running `target/debug/branches`
number is divisible by 3
```
When this program executes, it will check each `if` expression in turn and
execute the first body for which the condition holds true. Note that even
though 6 is divisible by 2, we did not see the output `number is divisible by
2`, nor did we see the `number is not divisible by 4, 3, or 2` text from the
`else` block. Thats because Rust will only execute the block for the first
true condition, and once it finds one, it wont even check the rest.
Using too many `else if` expressions can clutter your code, so if you find
yourself with more than one, you may want to look at refactoring your code. In
Chapter 6, well talk about a powerful Rust branching construct called `match`
for these cases.
#### Using `if` in a `let` statement
The last detail you need to know about `if` is that its an expression. That
means that we can use it on the right hand side of a `let` statement, for
instance:
Filename: src/main.rs
```rust
fn main() {
let condition = true;
let number = if condition {
5
} else {
6
};
println!("The value of number is: {}", number);
}
```
The `number` variable will be bound to a value based on the outcome of the `if`
expression. Lets run this to see what happens:
```bash
$ cargo run
Compiling branches v0.1.0 (file:///projects/branches)
Running `target/debug/branches`
The value of number is: 5
```
Remember, blocks of code evaluate to the last expression in them, and numbers
by themselves are also expressions. In this case, the value of the whole `if`
expression depends on which block of code executes. This means the value that
results from both arms of the `if` must be the same type; in the previous
example, they were both `i32` integers. But what happens if the types are
mismatched, as in the following example?
Filename: src/main.rs
```rust,ignore
fn main() {
let condition = true;
let number = if condition {
5
} else {
"six"
};
println!("The value of number is: {}", number);
}
```
If we try to run this, well get an error. The `if` and `else` arms have value
types that are incompatible, and Rust tells us exactly where to find the
problem in our program:
```bash
Compiling branches v0.1.0 (file:///projects/branches)
error[E0308]: if and else have incompatible types
--> src/main.rs:4:18
|
4 | let number = if condition {
| ^ expected integral variable, found reference
|
= note: expected type `{integer}`
= note: found type `&'static str`
error: aborting due to previous error
error: Could not compile `branches`.
```
The expression in the `if` block evaluates to an integer and the expression in
the `else` block evaluates to a string. This cant work, because variables
must have a single type. Rust needs to know at compile time what type
the `number` variable is, definitively, so that it can verify at compile time
that its type is valid everywhere we use `number`. Rust wouldnt be able to do
that if the type of `number` was only determined at runtime; the compiler would
be more complex and be able to make fewer guarantees about our code if it had
to keep track of multiple hypothetical types for any variable.
### Repetition with Loops
Its often useful to be able to execute a block of code more than one time. For
this, Rust has several *loops*. A loop runs through the code inside the loop
body to the end and then starts immediately back at the beginning. To try out
loops, lets make a new project called `loops`.
There are three kinds of loops in Rust: `loop`, `while`, and `for`. Lets dig
in.
#### Repeating Code with `loop`
The `loop` keyword tells Rust to execute a block of code over and over again
forever or until we explicitly tell it to stop.
For an example, change the `src/main.rs` file in your *loops* directory to look
like this:
Filename: src/main.rs
```rust,ignore
fn main() {
loop {
println!("again!");
}
}
```
If we run this program, well see `again!` printed over and over continuously
until we stop the program manually. Most terminals support a keyboard shortcut,
`control-c`, to halt a program stuck in a continual loop. Give it a try:
```bash
$ cargo run
Compiling loops v0.1.0 (file:///projects/loops)
Running `target/debug/loops`
again!
again!
again!
again!
^Cagain!
```
That `^C` there is where we hit `control-c`. You may or may not see `again!`
printed after the `^C`, depending on where the code was in the loop when it
received the signal to halt.
Fortunately, Rust provides another, more reliable way to break out of a loop.
We can place the `break` keyword within the loop to tell the program when to
stop executing the loop. Recall that we did this in the guessing game in
Chapter 2 to exit the program when the user won the game by guessing the number
correctly.
#### Conditional Loops With `while`
Its often useful for a program to have a condition that can be evaluated
within a loop. While the condition is true, the loop runs. When the condition
ceases to be true, we call `break`, stopping the loop. This could be
implemented with a combination of `loop`, `if`, `else`, and `break`; you could
try that now in a program, if youd like.
But this pattern is so common that Rust has a more efficient language construct
for it, called a `while` loop. Heres an example using `while`: this program
loops three times, counting down each time. Finally, after the loop, it prints
another message, then exits:
Filename: src/main.rs
```rust
fn main() {
let mut number = 3;
while number != 0 {
println!("{}!", number);
number = number - 1;
}
println!("LIFTOFF!!!");
}
```
This gets rid of a lot of nesting that would be necessary if we used `loop`,
`if`, `else`, and `break`, and its more clear. While a condition holds, run
this code; otherwise, exit the loop.
#### Looping Through a Collection with `for`
We could use this `while` construct to loop over the elements of a collection,
like an array. For example:
Filename: src/main.rs
```rust
fn main() {
let a = [10, 20, 30, 40, 50];
let mut index = 0;
while index < 5 {
println!("the value is: {}", a[index]);
index = index + 1;
}
}
```
Here, were counting up through the elements in the array. We start at index 0,
then loop until we hit the final index of our array (that is, when `index < 5`
is no longer true). Running this will print out every element of the array:
```bash
$ cargo run
Compiling loops v0.1.0 (file:///projects/loops)
Running `target/debug/loops`
the value is: 10
the value is: 20
the value is: 30
the value is: 40
the value is: 50
```
All five array values appear in the terminal, as expected. Even though `index`
will reach a value of `6` at some point, the loop stops executing before trying
to fetch a sixth value from the array.
This approach is error-prone, though; we could cause our program to panic by
getting the index length incorrect. Its also slow, as the compiler needs to
perform the conditional check on every element on every iteration through the
loop.
As a more efficient alternative, we can use a `for` loop and execute some code
for each item in a collection. A `for` loop looks like this:
Filename: src/main.rs
```rust
fn main() {
let a = [10, 20, 30, 40, 50];
for element in a.iter() {
println!("the value is: {}", element);
}
}
```
If we run this, well see the same output as the previous example. Importantly,
though, weve now increased the safety of our code and eliminated the chance of
bugs that might result from going beyond the end of the array or not going far
enough and missing some items.
For example, in the previous code that uses the `while` loop, if we removed an
item from the `a` array but forgot to update the condition to `while index <
4`, our code would panic. Using the `for` loop means we would not need to
remember to change any other code if we changed the number of values in the
array.
If youre wondering about the `iter` code in this example, keep reading! We
will cover method syntax generally in Chapter XX and iterators specifically in
Chapter XX.
The safety and conciseness of `for` loops make them the most commonly used loop
construct in Rust. Even in situations where you want to run some code a certain
number of times, like our countdown example that used a `while` loop, most
Rustaceans would use a `for` loop. The way to do that would be to use a
`Range`, which is a type provided by the standard library that generates all
numbers in sequence starting from one number and ending before another number.
Heres what the countdown would look like with a for loop, and using another
method we havent yet talked about, `rev`, to reverse the range:
Filename: src/main.rs
```rust
fn main() {
for number in (1..4).rev() {
println!("{}!", number);
}
println!("LIFTOFF!!!");
}
```
Thats a bit nicer, isnt it?
## Summary
You made it! That was a big chapter: we covered variables, scalar and
compound data types, functions, comments, `if` expressions, and loops! If youd
like to get some practice with the concepts in this chapter, try building
programs to do things like:
* Converting temperatures between Fahrenheit and Celsius.
* Generate the nth Fibonacci number.
* Print the lyrics to the Christmas carol *The Twelve Days of Christmas*,
taking advantage of the repetition in the song.
When youre ready to move on, well talk about a concept in Rust that *doesnt*
commonly exist in other programming languages: ownership.