Rust by Example Light
Rust by Example Light
Table of Contents
Rust By Example
Rust by Example
Hello World
Activity
Comments
See also:
Formatted print
Activities
See also:
Debug
See also:
Display
Activity
See also:
Testcase: List
Activity
See also:
Formatting
Activity
See also:
Primitives
Scalar Types
Compound Types
See also:
Literals and operators
Tuples
Activity
Arrays and Slices
Custom Types
Structures
Activity
See also
Enums
Type aliases
See also:
use
See also:
C-like
See also:
Testcase: linked-list
See also:
constants
See also:
Variable Bindings
Mutability
Scope and Shadowing
Declare first 2
Freezing
Types
Casting
Literals
Inference
Aliasing
See also:
Conversion
From and Into
From
Into
From and Into are interchangable
TryFrom and TryInto
To and from Strings
Converting to String
Parsing a String
Expressions
Flow of Control
if/else
loop
Nesting and labels
Returning from loops
while
for loops
for and range
for and iterators
See also:
match
Destructuring
tuples
See also:
arrays/slices
See also:
enums
See also:
pointers/ref
See also:
structs
See also:
Guards
See also:
Binding
See also:
if let
See also:
let-else
See also:
while let
See also:
Functions
Associated functions & Methods 3
Closures
Capturing
See also:
As input parameters
See also:
Type anonymity
See also:
Input functions
See also:
As output parameters
See also:
Examples in std
Iterator::any
See also:
Searching through iterators
See also:
Higher Order Functions
Diverging functions
Modules
Visibility
Struct visibility
See also:
The use declaration
super and self
File hierarchy
Crates
Creating a Library
Using a Library
Cargo
Dependencies
Conventions
Testing
Build Scripts
How to use a build script
Attributes
dead_code
Crates
cfg
See also:
Custom
Generics
See also:
Functions
See also:
Implementation
See also:
Traits
See also:
Bounds
See also:
Testcase: empty bounds 4
See also:
Multiple bounds
See also:
Where clauses
See also:
New Type Idiom
See also:
Associated items
See also:
The Problem
See also:
Associated types
Phantom type parameters
See also:
Testcase: unit clarification
See also:
Scoping rules
RAII
Destructor
See also:
Ownership and moves
Mutability
Partial moves
See also:
Borrowing
Mutability
See also:
Aliasing
The ref pattern
Lifetimes
Explicit annotation
See also:
Functions
See also:
Methods
See also:
Structs
See also:
Traits
See also:
Bounds
See also:
Coercion
Static
Reference lifetime
Trait bound
See also:
Elision
See also:
Traits
Derive 5
See also:
Returning Traits with dyn
Operator Overloading
See Also
Drop
Iterators
impl Trait
As an argument type
As a return type
Clone
Supertraits
See also:
Disambiguating overlapping traits
See also:
macro_rules!
Syntax
Designators
Overload
Repeat
DRY (Don't Repeat Yourself)
Domain Specific Languages (DSLs)
Variadic Interfaces
Error handling
panic
abort and unwind
Option & unwrap
Unpacking options with ?
Combinators: map
See also:
Combinators: and_then
See also:
Unpacking options and defaults
or() is chainable, evaluates eagerly, keeps empty value intact
or_else() is chainable, evaluates lazily, keeps empty value intact
get_or_insert() evaluates eagerly, modifies empty value in place
get_or_insert_with() evaluates lazily, modifies empty value in place
See also:
Result
Using Result in main
map for Result
aliases for Result
See also:
Early returns
Introducing ?
The try! macro
Multiple error types
Pulling Results out of Options
Defining an error type
Boxing errors
See also:
Other uses of ? 6
See also:
Wrapping errors
See also:
Iterating over Results
Ignore the failed items with filter_map()
Collect the failed items with map_err() and filter_map()
Fail the entire operation with collect()
Collect all valid values and failures with partition()
Std library types
See also:
Box, stack and heap
Vectors
Strings
Literals and escapes
Option
Result
?
panic!
HashMap
Alternate/custom key types
HashSet
Rc
See also:
Arc
Std misc
See also:
Threads
Testcase: map-reduce
Assignments
See also:
Channels
Path
See also:
File I/O
open
create
read_lines
A naive approach
A more efficient approach
Child processes
Pipes
Wait
Filesystem Operations
See also:
Program arguments
Standard Library
Crates
Argument parsing
Foreign Function Interface
Testing
See Also 7
Unit testing
Tests and ?
Testing panics
Running specific tests
Ignoring tests
Documentation testing
Motivation behind documentation tests
See Also
Integration testing
Development dependencies
See Also
Unsafe Operations
Raw Pointers
Calling Unsafe Functions
Inline assembly
Basic usage
Inputs and outputs
Late output operands
Explicit register operands
Clobbered registers
Symbol operands and ABI clobbers
Register template modifiers
Memory address operands
Labels
Options
Compatibility
Raw identifiers
Meta
Documentation
Doc comments
Doc attributes
inline
no_inline
hidden
See also:
Playground
Using it with mdbook
Using it with docs
See also:
8
Rust by Example
Rust is a modern systems programming language focusing on safety, speed, and concurrency. It
accomplishes these goals by being memory safe without using garbage collection.
Rust by Example (RBE) is a collection of runnable examples that illustrate various Rust concepts and
standard libraries. To get even more out of these examples, don't forget to install Rust locally and check
out the official docs. Additionally for the curious, you can also check out the source code for this site.
Primitives - Learn about signed integers, unsigned integers and other primitives.
Conversion - Convert between different types, such as strings, integers, and floats.
Cargo - Go through some basic features of the official Rust package management tool.
Generics - Learn about writing a function or data type which can work for multiple types of
arguments.
Scoping rules - Scopes play an important part in ownership, borrowing, and lifetimes.
Macros - Macros are a way of writing code that writes other code, which is known as
metaprogramming.
Std library types - Learn about some custom types provided by std library.
Hello World
This is the source code of the traditional Hello World program.
$ rustc hello.rs
$ ./hello
Hello World!
Activity
Click 'Run' above to see the expected output. Next, add a new line with a second println! macro so that
the output shows:
Hello World!
I'm a Rustacean!
12
13
Comments
Any program requires comments, and Rust supports a few different varieties:
1 fn main() {
2 // This is an example of a line comment.
3 // There are two slashes at the beginning of the line.
4 // And nothing written after these will be read by the compiler.
5
6 // println!("Hello, world!");
7
8 // Run it. See? Now try deleting the two slashes, and run it again.
9
10 /*
11 * This is another type of comment, a block comment. In general,
12 * line comments are the recommended comment style. But block comments
13 * are extremely useful for temporarily disabling chunks of code.
14 * /* Block comments can be /* nested, */ */ so it takes only a few
15 * keystrokes to comment out everything in this main() function.
16 * /*/*/* Try it yourself! */*/*/
17 */
18
19 /*
20 Note: The previous column of `*` was entirely for style. There's
21 no actual need for it.
22 */
23
24 // You can manipulate expressions more easily with block comments
25 // than with line comments. Try deleting the comment delimiters
26 // to change the result:
27 let x = 5 + /* 90 + */ 5;
28 println!("Is `x` 10 or 100? x = {}", x);
29 }
See also:
Library documentation
14
15
Formatted print
Printing is handled by a series of macros defined in std::fmt some of which are:
All parse text in the same fashion. As a plus, Rust checks formatting correctness at compile time.
1 fn main() { 16
2 // In general, the `{}` will be automatically replaced with any
3 // arguments. These will be stringified.
4 println!("{} days", 31);
5
6 // Positional arguments can be used. Specifying an integer inside `{}`
7 // determines which additional argument will be replaced. Arguments start
8 // at 0 immediately after the format string.
9 println!("{0}, this is {1}. {1}, this is {0}", "Alice", "Bob");
10
11 // As can named arguments.
12 println!("{subject} {verb} {object}",
13 object="the lazy dog",
14 subject="the quick brown fox",
15 verb="jumps over");
16
17 // Different formatting can be invoked by specifying the format character
18 // after a `:`.
19 println!("Base 10: {}", 69420); // 69420
20 println!("Base 2 (binary): {:b}", 69420); // 10000111100101100
21 println!("Base 8 (octal): {:o}", 69420); // 207454
22 println!("Base 16 (hexadecimal): {:x}", 69420); // 10f2c
23
24 // You can right-justify text with a specified width. This will
25 // output " 1". (Four white spaces and a "1", for a total width of 5.)
26 println!("{number:>5}", number=1);
27
28 // You can pad numbers with extra zeroes,
29 println!("{number:0>5}", number=1); // 00001
30 // and left-adjust by flipping the sign. This will output "10000".
31 println!("{number:0<5}", number=1); // 10000
32
33 // You can use named arguments in the format specifier by appending a `$`.
34 println!("{number:0>width$}", number=1, width=5);
35
36 // Rust even checks to make sure the correct number of arguments are used.
37 println!("My name is {0}, {1} {0}", "Bond");
38 // FIXME ^ Add the missing argument: "James"
39
40 // Only types that implement fmt::Display can be formatted with `{}`. User-
41 // defined types do not implement fmt::Display by default.
42
43 #[allow(dead_code)] // disable `dead_code` which warn against unused module
44 struct Structure(i32);
45
46 // This will not compile because `Structure` does not implement
47 // fmt::Display.
48 // println!("This struct `{}` won't print...", Structure(3));
49 // TODO ^ Try uncommenting this line
50
51 // For Rust 1.58 and above, you can directly capture the argument from a
52 // surrounding variable. Just like the above, this will output
53 // " 1", 4 white spaces and a "1".
54 let number: f64 = 1.0;
55 let width: usize = 5;
56 println!("{number:>width$}");
57 }
std::fmt contains many traits which govern the display of text. The base form of two important ones
are listed below:
fmt::Debug : Uses the {:?} marker. Format text for debugging purposes. 17
fmt::Display : Uses the {} marker. Format text in a more elegant, user friendly fashion.
Here, we used fmt::Display because the std library provides implementations for these types. To print
text for custom types, more steps are required.
Implementing the fmt::Display trait automatically implements the ToString trait which allows us to
convert the type to String .
In line 43, #[allow(dead_code)] is an attribute which only applies to the module after it.
Activities
Fix the issue in the above code (see FIXME) so that it runs without error.
Try uncommenting the line that attempts to format the Structure struct (see TODO)
Add a println! macro call that prints: Pi is roughly 3.142 by controlling the number of decimal
places shown. For the purposes of this exercise, use let pi = 3.141592 as an estimate for pi. (Hint:
you may need to check the std::fmt documentation for setting the number of decimals to display)
See also:
Debug
All types which want to use std::fmt formatting traits require an implementation to be printable.
Automatic implementations are only provided for types such as in the std library. All others must be
manually implemented somehow.
The fmt::Debug trait makes this very straightforward. All types can derive (automatically create) the
fmt::Debug implementation. This is not true for fmt::Display which must be manually implemented.
All std library types are automatically printable with {:?} too:
So fmt::Debug definitely makes this printable but sacrifices some elegance. Rust also provides "pretty
printing" with {:#?} .
1 #[derive(Debug)] 20
2 struct Person<'a> {
3 name: &'a str,
4 age: u8
5 }
6
7 fn main() {
8 let name = "Peter";
9 let age = 27;
10 let peter = Person { name, age };
11
12 // Pretty print
13 println!("{:#?}", peter);
14 }
See also:
Display
fmt::Debug hardly looks compact and clean, so it is often advantageous to customize the output
appearance. This is done by manually implementing fmt::Display , which uses the {} print marker.
Implementing it looks like this:
fmt::Display may be cleaner than fmt::Debug but this presents a problem for the std library. How
should ambiguous types be displayed? For example, if the std library implemented a single style for all
Vec<T> , what style should it be? Would it be either of these two?
No, because there is no ideal style for all types and the std library doesn't presume to dictate one.
fmt::Display is not implemented for Vec<T> or for any other generic containers. fmt::Debug must then
be used for these generic cases.
This is not a problem though because for any new container type which is not generic, fmt::Display can
be implemented.
1 use std::fmt; // Import `fmt` 23
2
3 // A structure holding two numbers. `Debug` will be derived so the results can
4 // be contrasted with `Display`.
5 #[derive(Debug)]
6 struct MinMax(i64, i64);
7
8 // Implement `Display` for `MinMax`.
9 impl fmt::Display for MinMax {
10 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
11 // Use `self.number` to refer to each positional data point.
12 write!(f, "({}, {})", self.0, self.1)
13 }
14 }
15
16 // Define a structure where the fields are nameable for comparison.
17 #[derive(Debug)]
18 struct Point2D {
19 x: f64,
20 y: f64,
21 }
22
23 // Similarly, implement `Display` for `Point2D`.
24 impl fmt::Display for Point2D {
25 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
26 // Customize so only `x` and `y` are denoted.
27 write!(f, "x: {}, y: {}", self.x, self.y)
28 }
29 }
30
31 fn main() {
32 let minmax = MinMax(0, 14);
33
34 println!("Compare structures:");
35 println!("Display: {}", minmax);
36 println!("Debug: {:?}", minmax);
37
38 let big_range = MinMax(-300, 300);
39 let small_range = MinMax(-3, 3);
40
41 println!("The big range is {big} and the small is {small}",
42 small = small_range,
43 big = big_range);
44
45 let point = Point2D { x: 3.3, y: 7.2 };
46
47 println!("Compare points:");
48 println!("Display: {}", point);
49 println!("Debug: {:?}", point);
50
51 // Error. Both `Debug` and `Display` were implemented, but `{:b}`
52 // requires `fmt::Binary` to be implemented. This will not work.
53 // println!("What does Point2D look like in binary: {:b}?", point);
54 }
So, fmt::Display has been implemented but fmt::Binary has not, and therefore cannot be used.
std::fmt has many such traits and each requires its own implementation. This is detailed further in
std::fmt .
Activity 24
After checking the output of the above example, use the Point2D struct as a guide to add a Complex struct
to the example. When printed in the same way, the output should be:
See also:
Testcase: List
Implementing fmt::Display for a structure where the elements must each be handled sequentially is
tricky. The problem is that each write! generates a fmt::Result . Proper handling of this requires dealing
with all the results. Rust provides the ? operator for exactly this purpose.
Activity
Try changing the program so that the index of each element in the vector is also printed. The new output
should look like this:
[0: 1, 1: 2, 2: 3]
See also: 27
Formatting
We've seen that formatting is specified via a format string:
The same variable ( foo ) can be formatted differently depending on which argument type is used: X vs o
vs unspecified.
This formatting functionality is implemented via traits, and there is one trait for each argument type. The
most common formatting trait is Display , which handles cases where the argument type is left
unspecified: {} for instance.
1 use std::fmt::{self, Formatter, Display}; 30
2
3 struct City {
4 name: &'static str,
5 // Latitude
6 lat: f32,
7 // Longitude
8 lon: f32,
9 }
10
11 impl Display for City {
12 // `f` is a buffer, and this method must write the formatted string into it.
13 fn fmt(&self, f: &mut Formatter) -> fmt::Result {
14 let lat_c = if self.lat >= 0.0 { 'N' } else { 'S' };
15 let lon_c = if self.lon >= 0.0 { 'E' } else { 'W' };
16
17 // `write!` is like `format!`, but it will write the formatted string
18 // into a buffer (the first argument).
19 write!(f, "{}: {:.3}°{} {:.3}°{}",
20 self.name, self.lat.abs(), lat_c, self.lon.abs(), lon_c)
21 }
22 }
23
24 #[derive(Debug)]
25 struct Color {
26 red: u8,
27 green: u8,
28 blue: u8,
29 }
30
31 fn main() {
32 for city in [
33 City { name: "Dublin", lat: 53.347778, lon: -6.259722 },
34 City { name: "Oslo", lat: 59.95, lon: 10.75 },
35 City { name: "Vancouver", lat: 49.25, lon: -123.1 },
36 ] {
37 println!("{}", city);
38 }
39 for color in [
40 Color { red: 128, green: 255, blue: 90 },
41 Color { red: 0, green: 3, blue: 254 },
42 Color { red: 0, green: 0, blue: 0 },
43 ] {
44 // Switch this to use {} once you've added an implementation
45 // for fmt::Display.
46 println!("{:?}", color);
47 }
48 }
You can view a full list of formatting traits and their argument types in the std::fmt documentation.
Activity
Add an implementation of the fmt::Display trait for the Color struct above so that the output displays
as:
RGB (128, 255, 90) 0x80FF5A 31
RGB (0, 3, 254) 0x0003FE
RGB (0, 0, 0) 0x000000
The formula for calculating a color in the RGB color space is: RGB = (R*65536)+(G*256)+B , (when
R is RED, G is GREEN and B is BLUE) . For more see RGB color format & calculation.
You may need to list each color more than once.
You can pad with zeros to a width of 2 with :0>2 .
See also:
std::fmt
32
33
Primitives
Rust provides access to a wide variety of primitives . A sample includes:
Scalar Types
Signed integers: i8 , i16 , i32 , i64 , i128 and isize (pointer size)
Unsigned integers: u8 , u16 , u32 , u64 , u128 and usize (pointer size)
Floating point: f32 , f64
char Unicode scalar values like 'a' , 'α' and '∞' (4 bytes each)
bool either true or false
The unit type () , whose only possible value is an empty tuple: ()
Despite the value of a unit type being a tuple, it is not considered a compound type because it does not
contain multiple values.
Compound Types
Variables can always be type annotated. Numbers may additionally be annotated via a suffix or by default.
Integers default to i32 and floats to f64 . Note that Rust can also infer types from context.
1 fn main() { 34
2 // Variables can be type annotated.
3 let logical: bool = true;
4
5 let a_float: f64 = 1.0; // Regular annotation
6 let an_integer = 5i32; // Suffix annotation
7
8 // Or a default will be used.
9 let default_float = 3.0; // `f64`
10 let default_integer = 7; // `i32`
11
12 // A type can also be inferred from context.
13 let mut inferred_type = 12; // Type i64 is inferred from another line.
14 inferred_type = 4294967296i64;
15
16 // A mutable variable's value can be changed.
17 let mut mutable = 12; // Mutable `i32`
18 mutable = 21;
19
20 // Error! The type of a variable can't be changed.
21 mutable = true;
22
23 // Variables can be overwritten with shadowing.
24 let mutable = true;
25
26 /* Compound types - Array and Tuple */
27
28 // Array signature consists of Type T and length as [T; length].
29 let my_array: [i32; 5] = [1, 2, 3, 4, 5];
30
31 // Tuple is a collection of values of different types
32 // and is constructed using parentheses ().
33 let my_tuple = (5u32, 1u8, true, -5.04f32);
34 }
See also:
Integers can, alternatively, be expressed using hexadecimal, octal or binary notation using these prefixes
respectively: 0x , 0o or 0b .
Underscores can be inserted in numeric literals to improve readability, e.g. 1_000 is the same as 1000 , and
0.000_001 is the same as 0.000001 .
Rust also supports scientific E-notation, e.g. 1e6 , 7.6e-4 . The associated type is f64 .
We need to tell the compiler the type of the literals we use. For now, we'll use the u32 suffix to indicate
that the literal is an unsigned 32-bit integer, and the i32 suffix to indicate that it's a signed 32-bit integer.
The operators available and their precedence in Rust are similar to other C-like languages.
1 fn main() {
2 // Integer addition
3 println!("1 + 2 = {}", 1u32 + 2);
4
5 // Integer subtraction
6 println!("1 - 2 = {}", 1i32 - 2);
7 // TODO ^ Try changing `1i32` to `1u32` to see why the type is important
8
9 // Scientific notation
10 println!("1e4 is {}, -2.5e-3 is {}", 1e4, -2.5e-3);
11
12 // Short-circuiting boolean logic
13 println!("true AND false is {}", true && false);
14 println!("true OR false is {}", true || false);
15 println!("NOT true is {}", !true);
16
17 // Bitwise operations
18 println!("0011 AND 0101 is {:04b}", 0b0011u32 & 0b0101);
19 println!("0011 OR 0101 is {:04b}", 0b0011u32 | 0b0101);
20 println!("0011 XOR 0101 is {:04b}", 0b0011u32 ^ 0b0101);
21 println!("1 << 5 is {}", 1u32 << 5);
22 println!("0x80 >> 2 is 0x{:x}", 0x80u32 >> 2);
23
24 // Use underscores to improve readability!
25 println!("One million is written as {}", 1_000_000u32);
26 }
37
38
Tuples
A tuple is a collection of values of different types. Tuples are constructed using parentheses () , and each
tuple itself is a value with type signature (T1, T2, ...) , where T1 , T2 are the types of its members.
Functions can use tuples to return multiple values, as tuples can hold any number of values.
1 // Tuples can be used as function arguments and as return values. 39
2 fn reverse(pair: (i32, bool)) -> (bool, i32) {
3 // `let` can be used to bind the members of a tuple to variables.
4 let (int_param, bool_param) = pair;
5
6 (bool_param, int_param)
7 }
8
9 // The following struct is for the activity.
10 #[derive(Debug)]
11 struct Matrix(f32, f32, f32, f32);
12
13 fn main() {
14 // A tuple with a bunch of different types.
15 let long_tuple = (1u8, 2u16, 3u32, 4u64,
16 -1i8, -2i16, -3i32, -4i64,
17 0.1f32, 0.2f64,
18 'a', true);
19
20 // Values can be extracted from the tuple using tuple indexing.
21 println!("Long tuple first value: {}", long_tuple.0);
22 println!("Long tuple second value: {}", long_tuple.1);
23
24 // Tuples can be tuple members.
25 let tuple_of_tuples = ((1u8, 2u16, 2u32), (4u64, -1i8), -2i16);
26
27 // Tuples are printable.
28 println!("tuple of tuples: {:?}", tuple_of_tuples);
29
30 // But long Tuples (more than 12 elements) cannot be printed.
31 //let too_long_tuple = (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13);
32 //println!("Too long tuple: {:?}", too_long_tuple);
33 // TODO ^ Uncomment the above 2 lines to see the compiler error
34
35 let pair = (1, true);
36 println!("Pair is {:?}", pair);
37
38 println!("The reversed pair is {:?}", reverse(pair));
39
40 // To create one element tuples, the comma is required to tell them apart
41 // from a literal surrounded by parentheses.
42 println!("One element tuple: {:?}", (5u32,));
43 println!("Just an integer: {:?}", (5u32));
44
45 // Tuples can be destructured to create bindings.
46 let tuple = (1, "hello", 4.5, true);
47
48 let (a, b, c, d) = tuple;
49 println!("{:?}, {:?}, {:?}, {:?}", a, b, c, d);
50
51 let matrix = Matrix(1.1, 1.2, 2.1, 2.2);
52 println!("{:?}", matrix);
53 }
Activity
1. Recap: Add the fmt::Display trait to the Matrix struct in the above example, so that if you switch
from printing the debug format {:?} to the display format {} , you see the following output:
( 1.1 1.2 )
40
( 2.1 2.2 )
You may want to refer back to the example for print display.
2. Add a transpose function using the reverse function as a template, which accepts a matrix as an
argument, and returns a matrix in which two elements have been swapped. For example:
println!("Matrix:\n{}", matrix);
println!("Transpose:\n{}", transpose(matrix));
Matrix:
( 1.1 1.2 )
( 2.1 2.2 )
Transpose:
( 1.1 2.1 )
( 1.2 2.2 )
41
42
Slices are similar to arrays, but their length is not known at compile time. Instead, a slice is a two-word
object; the first word is a pointer to the data, the second word is the length of the slice. The word size is the
same as usize, determined by the processor architecture, e.g. 64 bits on an x86-64. Slices can be used to
borrow a section of an array and have the type signature &[T] .
1 use std::mem; 43
2
3 // This function borrows a slice.
4 fn analyze_slice(slice: &[i32]) {
5 println!("First element of the slice: {}", slice[0]);
6 println!("The slice has {} elements", slice.len());
7 }
8
9 fn main() {
10 // Fixed-size array (type signature is superfluous).
11 let xs: [i32; 5] = [1, 2, 3, 4, 5];
12
13 // All elements can be initialized to the same value.
14 let ys: [i32; 500] = [0; 500];
15
16 // Indexing starts at 0.
17 println!("First element of the array: {}", xs[0]);
18 println!("Second element of the array: {}", xs[1]);
19
20 // `len` returns the count of elements in the array.
21 println!("Number of elements in array: {}", xs.len());
22
23 // Arrays are stack allocated.
24 println!("Array occupies {} bytes", mem::size_of_val(&xs));
25
26 // Arrays can be automatically borrowed as slices.
27 println!("Borrow the whole array as a slice.");
28 analyze_slice(&xs);
29
30 // Slices can point to a section of an array.
31 // They are of the form [starting_index..ending_index].
32 // `starting_index` is the first position in the slice.
33 // `ending_index` is one more than the last position in the slice.
34 println!("Borrow a section of the array as a slice.");
35 analyze_slice(&ys[1 .. 4]);
36
37 // Example of empty slice `&[]`:
38 let empty_array: [u32; 0] = [];
39 assert_eq!(&empty_array, &[]);
40 assert_eq!(&empty_array, &[][..]); // Same but more verbose
41
42 // Arrays can be safely accessed using `.get`, which returns an
43 // `Option`. This can be matched as shown below, or used with
44 // `.expect()` if you would like the program to exit with a nice
45 // message instead of happily continue.
46 for i in 0..xs.len() + 1 { // Oops, one element too far!
47 match xs.get(i) {
48 Some(xval) => println!("{}: {}", i, xval),
49 None => println!("Slow down! {} is too far!", i),
50 }
51 }
52
53 // Out of bound indexing on array causes compile time error.
54 //println!("{}", xs[5]);
55 // Out of bound indexing on slice causes runtime error.
56 //println!("{}", xs[..][5]);
57 }
44
45
Custom Types
Rust custom data types are formed mainly through the two keywords:
Constants can also be created via the const and static keywords.
46
47
Structures
There are three types of structures ("structs") that can be created using the struct keyword:
Activity
1. Add a function rect_area which calculates the area of a Rectangle (try using nested destructuring).
2. Add a function square which takes a Point and a f32 as arguments, and returns a Rectangle
with its top left corner on the point, and a width and height corresponding to the f32 .
See also
Enums
The enum keyword allows the creation of a type which may be one of a few different variants. Any variant
which is valid as a struct is also valid in an enum .
If you use a type alias, you can refer to each enum variant via its alias. This might be useful if the enum's
name is too long or too generic, and you want to rename it.
1 enum VeryVerboseEnumOfThingsToDoWithNumbers {
2 Add,
3 Subtract,
4 }
5
6 // Creates a type alias
7 type Operations = VeryVerboseEnumOfThingsToDoWithNumbers;
8
9 fn main() {
10 // We can refer to each variant via its alias, not its long and inconvenient
11 // name.
12 let x = Operations::Add;
13 }
The most common place you'll see this is in impl blocks using the Self alias.
1 enum VeryVerboseEnumOfThingsToDoWithNumbers {
2 Add,
3 Subtract,
4 }
5
6 impl VeryVerboseEnumOfThingsToDoWithNumbers {
7 fn run(&self, x: i32, y: i32) -> i32 {
8 match self {
9 Self::Add => x + y,
10 Self::Subtract => x - y,
11 }
12 }
13 }
To learn more about enums and type aliases, you can read the stabilization report from when this feature
was stabilized into Rust.
See also:
use
The use declaration can be used so manual scoping isn't needed:
See also:
C-like
enum can also be used as C-like enums.
See also:
casting
57
58
Testcase: linked-list
A common way to implement a linked-list is via enums :
1 use crate::List::*; 59
2
3 enum List {
4 // Cons: Tuple struct that wraps an element and a pointer to the next node
5 Cons(u32, Box<List>),
6 // Nil: A node that signifies the end of the linked list
7 Nil,
8 }
9
10 // Methods can be attached to an enum
11 impl List {
12 // Create an empty list
13 fn new() -> List {
14 // `Nil` has type `List`
15 Nil
16 }
17
18 // Consume a list, and return the same list with a new element at its front
19 fn prepend(self, elem: u32) -> List {
20 // `Cons` also has type List
21 Cons(elem, Box::new(self))
22 }
23
24 // Return the length of the list
25 fn len(&self) -> u32 {
26 // `self` has to be matched, because the behavior of this method
27 // depends on the variant of `self`
28 // `self` has type `&List`, and `*self` has type `List`, matching on a
29 // concrete type `T` is preferred over a match on a reference `&T`
30 // after Rust 2018 you can use self here and tail (with no ref) below as well,
31 // rust will infer &s and ref tail.
32 // See https://doc.rust-lang.org/edition-guide/rust-2018/ownership-and-lifetimes
33 match *self {
34 // Can't take ownership of the tail, because `self` is borrowed;
35 // instead take a reference to the tail
36 Cons(_, ref tail) => 1 + tail.len(),
37 // Base Case: An empty list has zero length
38 Nil => 0
39 }
40 }
41
42 // Return representation of the list as a (heap allocated) string
43 fn stringify(&self) -> String {
44 match *self {
45 Cons(head, ref tail) => {
46 // `format!` is similar to `print!`, but returns a heap
47 // allocated string instead of printing to the console
48 format!("{}, {}", head, tail.stringify())
49 },
50 Nil => {
51 format!("Nil")
52 },
53 }
54 }
55 }
56
57 fn main() {
58 // Create an empty linked list
59 let mut list = List::new();
60
61 // Prepend some elements
62 list = list.prepend(1);
63 list = list.prepend(2);
64 list = list.prepend(3); 60
65
66 // Show the final state of the list
67 println!("linked list has length: {}", list.len());
68 println!("{}", list.stringify());
69 }
See also:
constants
Rust has two different types of constants which can be declared in any scope including global. Both require
explicit type annotation:
See also:
Variable Bindings
Rust provides type safety via static typing. Variable bindings can be type annotated when declared.
However, in most cases, the compiler will be able to infer the type of the variable from the context, heavily
reducing the annotation burden.
Values (like literals) can be bound to variables, using the let binding.
1 fn main() {
2 let an_integer = 1u32;
3 let a_boolean = true;
4 let unit = ();
5
6 // copy `an_integer` into `copied_integer`
7 let copied_integer = an_integer;
8
9 println!("An integer: {:?}", copied_integer);
10 println!("A boolean: {:?}", a_boolean);
11 println!("Meet the unit value: {:?}", unit);
12
13 // The compiler warns about unused variable bindings; these warnings can
14 // be silenced by prefixing the variable name with an underscore
15 let _unused_variable = 3u32;
16
17 let noisy_unused_variable = 2u32;
18 // FIXME ^ Prefix with an underscore to suppress the warning
19 // Please note that warnings may not be shown in a browser
20 }
65
66
Mutability
Variable bindings are immutable by default, but this can be overridden using the mut modifier.
1 fn main() {
2 let _immutable_binding = 1;
3 let mut mutable_binding = 1;
4
5 println!("Before mutation: {}", mutable_binding);
6
7 // Ok
8 mutable_binding += 1;
9
10 println!("After mutation: {}", mutable_binding);
11
12 // Error! Cannot assign a new value to an immutable variable
13 _immutable_binding += 1;
14 }
1 fn main() {
2 // This binding lives in the main function
3 let long_lived_binding = 1;
4
5 // This is a block, and has a smaller scope than the main function
6 {
7 // This binding only exists in this block
8 let short_lived_binding = 2;
9
10 println!("inner short: {}", short_lived_binding);
11 }
12 // End of the block
13
14 // Error! `short_lived_binding` doesn't exist in this scope
15 println!("outer short: {}", short_lived_binding);
16 // FIXME ^ Comment out this line
17
18 println!("outer long: {}", long_lived_binding);
19 }
1 fn main() {
2 let shadowed_binding = 1;
3
4 {
5 println!("before being shadowed: {}", shadowed_binding);
6
7 // This binding *shadows* the outer one
8 let shadowed_binding = "abc";
9
10 println!("shadowed in inner block: {}", shadowed_binding);
11 }
12 println!("outside inner block: {}", shadowed_binding);
13
14 // This binding *shadows* the previous binding
15 let shadowed_binding = 2;
16 println!("shadowed in outer block: {}", shadowed_binding);
17 }
69
70
Declare first
It's possible to declare variable bindings first, and initialize them later. However, this form is seldom used,
as it may lead to the use of uninitialized variables.
1 fn main() {
2 // Declare a variable binding
3 let a_binding;
4
5 {
6 let x = 2;
7
8 // Initialize the binding
9 a_binding = x * x;
10 }
11
12 println!("a binding: {}", a_binding);
13
14 let another_binding;
15
16 // Error! Use of uninitialized binding
17 println!("another binding: {}", another_binding);
18 // FIXME ^ Comment out this line
19
20 another_binding = 1;
21
22 println!("another binding: {}", another_binding);
23 }
The compiler forbids use of uninitialized variables, as this would lead to undefined behavior.
71
72
Freezing
When data is bound by the same name immutably, it also freezes. Frozen data can't be modified until the
immutable binding goes out of scope:
1 fn main() {
2 let mut _mutable_integer = 7i32;
3
4 {
5 // Shadowing by immutable `_mutable_integer`
6 let _mutable_integer = _mutable_integer;
7
8 // Error! `_mutable_integer` is frozen in this scope
9 _mutable_integer = 50;
10 // FIXME ^ Comment out this line
11
12 // `_mutable_integer` goes out of scope
13 }
14
15 // Ok! `_mutable_integer` is not frozen in this scope
16 _mutable_integer = 3;
17 }
73
74
Types
Rust provides several mechanisms to change or define the type of primitive and user defined types. The
following sections cover:
Casting
Rust provides no implicit type conversion (coercion) between primitive types. But, explicit type conversion
(casting) can be performed using the as keyword.
Rules for converting between integral types follow C conventions generally, except in cases where C has
undefined behavior. The behavior of all casts between integral types is well defined in Rust.
1 // Suppress all warnings from casts which overflow. 77
2 #![allow(overflowing_literals)]
3
4 fn main() {
5 let decimal = 65.4321_f32;
6
7 // Error! No implicit conversion
8 let integer: u8 = decimal;
9 // FIXME ^ Comment out this line
10
11 // Explicit conversion
12 let integer = decimal as u8;
13 let character = integer as char;
14
15 // Error! There are limitations in conversion rules.
16 // A float cannot be directly converted to a char.
17 let character = decimal as char;
18 // FIXME ^ Comment out this line
19
20 println!("Casting: {} -> {} -> {}", decimal, integer, character);
21
22 // when casting any value to an unsigned type, T,
23 // T::MAX + 1 is added or subtracted until the value
24 // fits into the new type
25
26 // 1000 already fits in a u16
27 println!("1000 as a u16 is: {}", 1000 as u16);
28
29 // 1000 - 256 - 256 - 256 = 232
30 // Under the hood, the first 8 least significant bits (LSB) are kept,
31 // while the rest towards the most significant bit (MSB) get truncated.
32 println!("1000 as a u8 is : {}", 1000 as u8);
33 // -1 + 256 = 255
34 println!(" -1 as a u8 is : {}", (-1i8) as u8);
35
36 // For positive numbers, this is the same as the modulus
37 println!("1000 mod 256 is : {}", 1000 % 256);
38
39 // When casting to a signed type, the (bitwise) result is the same as
40 // first casting to the corresponding unsigned type. If the most significant
41 // bit of that value is 1, then the value is negative.
42
43 // Unless it already fits, of course.
44 println!(" 128 as a i16 is: {}", 128 as i16);
45
46 // In boundary case 128 value in 8-bit two's complement representation is -128
47 println!(" 128 as a i8 is : {}", 128 as i8);
48
49 // repeating the example above
50 // 1000 as u8 -> 232
51 println!("1000 as a u8 is : {}", 1000 as u8);
52 // and the value of 232 in 8-bit two's complement representation is -24
53 println!(" 232 as a i8 is : {}", 232 as i8);
54
55 // Since Rust 1.45, the `as` keyword performs a *saturating cast*
56 // when casting from float to int. If the floating point value exceeds
57 // the upper bound or is less than the lower bound, the returned value
58 // will be equal to the bound crossed.
59
60 // 300.0 as u8 is 255
61 println!(" 300.0 as u8 is : {}", 300.0_f32 as u8);
62 // -100.0 as u8 is 0
63 println!("-100.0 as u8 is : {}", -100.0_f32 as u8);
64 // nan as u8 is 0 78
65 println!(" nan as u8 is : {}", f32::NAN as u8);
66
67 // This behavior incurs a small runtime cost and can be avoided
68 // with unsafe methods, however the results might overflow and
69 // return **unsound values**. Use these methods wisely:
70 unsafe {
71 // 300.0 as u8 is 44
72 println!(" 300.0 as u8 is : {}", 300.0_f32.to_int_unchecked::<u8>());
73 // -100.0 as u8 is 156
74 println!("-100.0 as u8 is : {}", (-100.0_f32).to_int_unchecked::<u8>());
75 // nan as u8 is 0
76 println!(" nan as u8 is : {}", f32::NAN.to_int_unchecked::<u8>());
77 }
78 }
79
80
Literals
Numeric literals can be type annotated by adding the type as a suffix. As an example, to specify that the
literal 42 should have the type i32 , write 42i32 .
The type of unsuffixed numeric literals will depend on how they are used. If no constraint exists, the
compiler will use i32 for integers, and f64 for floating-point numbers.
1 fn main() {
2 // Suffixed literals, their types are known at initialization
3 let x = 1u8;
4 let y = 2u32;
5 let z = 3f32;
6
7 // Unsuffixed literals, their types depend on how they are used
8 let i = 1;
9 let f = 1.0;
10
11 // `size_of_val` returns the size of a variable in bytes
12 println!("size of `x` in bytes: {}", std::mem::size_of_val(&x));
13 println!("size of `y` in bytes: {}", std::mem::size_of_val(&y));
14 println!("size of `z` in bytes: {}", std::mem::size_of_val(&z));
15 println!("size of `i` in bytes: {}", std::mem::size_of_val(&i));
16 println!("size of `f` in bytes: {}", std::mem::size_of_val(&f));
17 }
There are some concepts used in the previous code that haven't been explained yet, here's a brief
explanation for the impatient readers:
std::mem::size_of_val is a function, but called with its full path. Code can be split in logical units
called modules. In this case, the size_of_val function is defined in the mem module, and the mem
module is defined in the std crate. For more details, see modules and crates.
81
82
Inference
The type inference engine is pretty smart. It does more than looking at the type of the value expression
during an initialization. It also looks at how the variable is used afterwards to infer its type. Here's an
advanced example of type inference:
1 fn main() {
2 // Because of the annotation, the compiler knows that `elem` has type u8.
3 let elem = 5u8;
4
5 // Create an empty vector (a growable array).
6 let mut vec = Vec::new();
7 // At this point the compiler doesn't know the exact type of `vec`, it
8 // just knows that it's a vector of something (`Vec<_>`).
9
10 // Insert `elem` in the vector.
11 vec.push(elem);
12 // Aha! Now the compiler knows that `vec` is a vector of `u8`s (`Vec<u8>`)
13 // TODO ^ Try commenting out the `vec.push(elem)` line
14
15 println!("{:?}", vec);
16 }
No type annotation of variables was needed, the compiler is happy and so is the programmer!
83
84
Aliasing
The type statement can be used to give a new name to an existing type. Types must have
UpperCamelCase names, or the compiler will raise a warning. The exception to this rule are the primitive
types: usize , f32 , etc.
The main use of aliases is to reduce boilerplate; for example the io::Result<T> type is an alias for the
Result<T, io::Error> type.
See also:
Attributes
85
86
Conversion
Primitive types can be converted to each other through casting.
Rust addresses conversion between custom types (i.e., struct and enum ) by the use of traits. The generic
conversions will use the From and Into traits. However there are more specific ones for the more
common cases, in particular when converting to and from String s.
87
88
From
The From trait allows for a type to define how to create itself from another type, hence providing a very
simple mechanism for converting between several types. There are numerous implementations of this trait
within the standard library for conversion of primitive and common types.
We can do something similar for defining a conversion for our own type.
1 use std::convert::From;
2
3 #[derive(Debug)]
4 struct Number {
5 value: i32,
6 }
7
8 impl From<i32> for Number {
9 fn from(item: i32) -> Self {
10 Number { value: item }
11 }
12 }
13
14 fn main() {
15 let num = Number::from(30);
16 println!("My number is {:?}", num);
17 }
Into
The Into trait is simply the reciprocal of the From trait. It defines how to convert a type into another type.
Calling into() typically requires us to specify the result type as the compiler is unable to determine this
most of the time.
1 use std::convert::Into; 89
2
3 #[derive(Debug)]
4 struct Number {
5 value: i32,
6 }
7
8 impl Into<Number> for i32 {
9 fn into(self) -> Number {
10 Number { value: self }
11 }
12 }
13
14 fn main() {
15 let int = 5;
16 // Try removing the type annotation
17 let num: Number = int.into();
18 println!("My number is {:?}", num);
19 }
1 use std::convert::From;
2
3 #[derive(Debug)]
4 struct Number {
5 value: i32,
6 }
7
8 // Define `From`
9 impl From<i32> for Number {
10 fn from(item: i32) -> Self {
11 Number { value: item }
12 }
13 }
14
15 fn main() {
16 let int = 5;
17 // use `Into`
18 let num: Number = int.into();
19 println!("My number is {:?}", num);
20 }
90
91
1 use std::convert::TryFrom;
2 use std::convert::TryInto;
3
4 #[derive(Debug, PartialEq)]
5 struct EvenNumber(i32);
6
7 impl TryFrom<i32> for EvenNumber {
8 type Error = ();
9
10 fn try_from(value: i32) -> Result<Self, Self::Error> {
11 if value % 2 == 0 {
12 Ok(EvenNumber(value))
13 } else {
14 Err(())
15 }
16 }
17 }
18
19 fn main() {
20 // TryFrom
21
22 assert_eq!(EvenNumber::try_from(8), Ok(EvenNumber(8)));
23 assert_eq!(EvenNumber::try_from(5), Err(()));
24
25 // TryInto
26
27 let result: Result<EvenNumber, ()> = 8i32.try_into();
28 assert_eq!(result, Ok(EvenNumber(8)));
29 let result: Result<EvenNumber, ()> = 5i32.try_into();
30 assert_eq!(result, Err(()));
31 }
92
93
Converting to String
To convert any type to a String is as simple as implementing the ToString trait for the type. Rather than
doing so directly, you should implement the fmt::Display trait which automagically provides ToString
and also allows printing the type as discussed in the section on print! .
1 use std::fmt;
2
3 struct Circle {
4 radius: i32
5 }
6
7 impl fmt::Display for Circle {
8 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
9 write!(f, "Circle of radius {}", self.radius)
10 }
11 }
12
13 fn main() {
14 let circle = Circle { radius: 6 };
15 println!("{}", circle.to_string());
16 }
Parsing a String
It's useful to convert strings into many types, but one of the more common string operations is to convert
them from string to number. The idiomatic approach to this is to use the parse function and either to
arrange for type inference or to specify the type to parse using the 'turbofish' syntax. Both alternatives are
shown in the following example.
This will convert the string into the type specified as long as the FromStr trait is implemented for that
type. This is implemented for numerous types within the standard library.
1 fn main() {
2 let parsed: i32 = "5".parse().unwrap();
3 let turbo_parsed = "10".parse::<i32>().unwrap();
4
5 let sum = parsed + turbo_parsed;
6 println!("Sum: {:?}", sum);
7 }
To obtain this functionality on a user defined type simply implement the FromStr trait for that type.
1 use std::num::ParseIntError; 94
2 use std::str::FromStr;
3
4 #[derive(Debug)]
5 struct Circle {
6 radius: i32,
7 }
8
9 impl FromStr for Circle {
10 type Err = ParseIntError;
11 fn from_str(s: &str) -> Result<Self, Self::Err> {
12 match s.trim().parse() {
13 Ok(num) => Ok(Circle{ radius: num }),
14 Err(e) => Err(e),
15 }
16 }
17 }
18
19 fn main() {
20 let radius = " 3 ";
21 let circle: Circle = radius.parse().unwrap();
22 println!("{:?}", circle);
23 }
95
96
Expressions
A Rust program is (mostly) made up of a series of statements:
1 fn main() {
2 // statement
3 // statement
4 // statement
5 }
There are a few kinds of statements in Rust. The most common two are declaring a variable binding, and
using a ; with an expression:
1 fn main() {
2 // variable binding
3 let x = 5;
4
5 // expression;
6 x;
7 x + 1;
8 15;
9 }
Blocks are expressions too, so they can be used as values in assignments. The last expression in the block
will be assigned to the place expression such as a local variable. However, if the last expression of the
block ends with a semicolon, the return value will be () .
1 fn main() {
2 let x = 5u32;
3
4 let y = {
5 let x_squared = x * x;
6 let x_cube = x_squared * x;
7
8 // This expression will be assigned to `y`
9 x_cube + x_squared + x
10 };
11
12 let z = {
13 // The semicolon suppresses this expression and `()` is assigned to `z`
14 2 * x;
15 };
16
17 println!("x is {:?}", x);
18 println!("y is {:?}", y);
19 println!("z is {:?}", z);
20 }
97
98
Flow of Control
An integral part of any programming language are ways to modify control flow: if / else , for , and
others. Let's talk about them in Rust.
99
100
if/else
Branching with if - else is similar to other languages. Unlike many of them, the boolean condition
doesn't need to be surrounded by parentheses, and each condition is followed by a block. if - else
conditionals are expressions, and, all branches must return the same type.
1 fn main() {
2 let n = 5;
3
4 if n < 0 {
5 print!("{} is negative", n);
6 } else if n > 0 {
7 print!("{} is positive", n);
8 } else {
9 print!("{} is zero", n);
10 }
11
12 let big_n =
13 if n < 10 && n > -10 {
14 println!(", and is a small number, increase ten-fold");
15
16 // This expression returns an `i32`.
17 10 * n
18 } else {
19 println!(", and is a big number, halve the number");
20
21 // This expression must return an `i32` as well.
22 n / 2
23 // TODO ^ Try suppressing this expression with a semicolon.
24 };
25 // ^ Don't forget to put a semicolon here! All `let` bindings need it.
26
27 println!("{} -> {}", n, big_n);
28 }
101
102
loop
Rust provides a loop keyword to indicate an infinite loop.
The break statement can be used to exit a loop at anytime, whereas the continue statement can be used
to skip the rest of the iteration and start a new one.
1 fn main() {
2 let mut count = 0u32;
3
4 println!("Let's count until infinity!");
5
6 // Infinite loop
7 loop {
8 count += 1;
9
10 if count == 3 {
11 println!("three");
12
13 // Skip the rest of this iteration
14 continue;
15 }
16
17 println!("{}", count);
18
19 if count == 5 {
20 println!("OK, that's enough");
21
22 // Exit this loop
23 break;
24 }
25 }
26 }
103
104
1 #![allow(unreachable_code, unused_labels)]
2
3 fn main() {
4 'outer: loop {
5 println!("Entered the outer loop");
6
7 'inner: loop {
8 println!("Entered the inner loop");
9
10 // This would break only the inner loop
11 //break;
12
13 // This breaks the outer loop
14 break 'outer;
15 }
16
17 println!("This point will never be reached");
18 }
19
20 println!("Exited the outer loop");
21 }
105
106
1 fn main() {
2 let mut counter = 0;
3
4 let result = loop {
5 counter += 1;
6
7 if counter == 10 {
8 break counter * 2;
9 }
10 };
11
12 assert_eq!(result, 20);
13 }
107
108
while
The while keyword can be used to run a loop while a condition is true.
1 fn main() {
2 // A counter variable
3 let mut n = 1;
4
5 // Loop while `n` is less than 101
6 while n < 101 {
7 if n % 15 == 0 {
8 println!("fizzbuzz");
9 } else if n % 3 == 0 {
10 println!("fizz");
11 } else if n % 5 == 0 {
12 println!("buzz");
13 } else {
14 println!("{}", n);
15 }
16
17 // Increment counter
18 n += 1;
19 }
20 }
109
110
for loops
1 fn main() {
2 // `n` will take the values: 1, 2, ..., 100 in each iteration
3 for n in 1..101 {
4 if n % 15 == 0 {
5 println!("fizzbuzz");
6 } else if n % 3 == 0 {
7 println!("fizz");
8 } else if n % 5 == 0 {
9 println!("buzz");
10 } else {
11 println!("{}", n);
12 }
13 }
14 }
Alternatively, a..=b can be used for a range that is inclusive on both ends. The above can be written as:
1 fn main() {
2 // `n` will take the values: 1, 2, ..., 100 in each iteration
3 for n in 1..=100 {
4 if n % 15 == 0 {
5 println!("fizzbuzz");
6 } else if n % 3 == 0 {
7 println!("fizz");
8 } else if n % 5 == 0 {
9 println!("buzz");
10 } else {
11 println!("{}", n);
12 }
13 }
14 }
into_iter , iter and iter_mut all handle the conversion of a collection into an iterator in different ways,
by providing different views on the data within.
iter - This borrows each element of the collection through each iteration. Thus leaving the 111
collection untouched and available for reuse after the loop.
1 fn main() {
2 let names = vec!["Bob", "Frank", "Ferris"];
3
4 for name in names.iter() {
5 match name {
6 &"Ferris" => println!("There is a rustacean among us!"),
7 // TODO ^ Try deleting the & and matching just "Ferris"
8 _ => println!("Hello {}", name),
9 }
10 }
11
12 println!("names: {:?}", names);
13 }
into_iter - This consumes the collection so that on each iteration the exact data is provided. Once
the collection has been consumed it is no longer available for reuse as it has been 'moved' within the
loop.
1 fn main() {
2 let names = vec!["Bob", "Frank", "Ferris"];
3
4 for name in names.into_iter() {
5 match name {
6 "Ferris" => println!("There is a rustacean among us!"),
7 _ => println!("Hello {}", name),
8 }
9 }
10
11 println!("names: {:?}", names);
12 // FIXME ^ Comment out this line
13 }
iter_mut - This mutably borrows each element of the collection, allowing for the collection to be
modified in place.
1 fn main() {
2 let mut names = vec!["Bob", "Frank", "Ferris"];
3
4 for name in names.iter_mut() {
5 *name = match name {
6 &mut "Ferris" => "There is a rustacean among us!",
7 _ => "Hello",
8 }
9 }
10
11 println!("names: {:?}", names);
12 }
In the above snippets note the type of match branch, that is the key difference in the types of iteration. The
difference in type then of course implies differing actions that are able to be performed.
See also:
Iterator
112
113
match
Rust provides pattern matching via the match keyword, which can be used like a C switch . The first
matching arm is evaluated and all possible values must be covered.
1 fn main() {
2 let number = 13;
3 // TODO ^ Try different values for `number`
4
5 println!("Tell me about {}", number);
6 match number {
7 // Match a single value
8 1 => println!("One!"),
9 // Match several values
10 2 | 3 | 5 | 7 | 11 => println!("This is a prime"),
11 // TODO ^ Try adding 13 to the list of prime values
12 // Match an inclusive range
13 13..=19 => println!("A teen"),
14 // Handle the rest of cases
15 _ => println!("Ain't special"),
16 // TODO ^ Try commenting out this catch-all arm
17 }
18
19 let boolean = true;
20 // Match is an expression too
21 let binary = match boolean {
22 // The arms of a match must cover all the possible values
23 false => 0,
24 true => 1,
25 // TODO ^ Try commenting out one of these arms
26 };
27
28 println!("{} -> {}", boolean, binary);
29 }
114
115
Destructuring
A match block can destructure items in a variety of ways.
Destructuring Tuples
Destructuring Arrays and Slices
Destructuring Enums
Destructuring Pointers
Destructuring Structures
116
117
tuples
Tuples can be destructured in a match as follows:
1 fn main() {
2 let triple = (0, -2, 3);
3 // TODO ^ Try different values for `triple`
4
5 println!("Tell me about {:?}", triple);
6 // Match can be used to destructure a tuple
7 match triple {
8 // Destructure the second and third elements
9 (0, y, z) => println!("First is `0`, `y` is {:?}, and `z` is {:?}", y, z),
10 (1, ..) => println!("First is `1` and the rest doesn't matter"),
11 (.., 2) => println!("last is `2` and the rest doesn't matter"),
12 (3, .., 4) => println!("First is `3`, last is `4`, and the rest doesn't matter
13 // `..` can be used to ignore the rest of the tuple
14 _ => println!("It doesn't matter what they are"),
15 // `_` means don't bind the value to a variable
16 }
17 }
See also:
Tuples
118
119
arrays/slices
Like tuples, arrays and slices can be destructured this way:
1 fn main() {
2 // Try changing the values in the array, or make it a slice!
3 let array = [1, -2, 6];
4
5 match array {
6 // Binds the second and the third elements to the respective variables
7 [0, second, third] =>
8 println!("array[0] = 0, array[1] = {}, array[2] = {}", second, third),
9
10 // Single values can be ignored with _
11 [1, _, third] => println!(
12 "array[0] = 1, array[2] = {} and array[1] was ignored",
13 third
14 ),
15
16 // You can also bind some and ignore the rest
17 [-1, second, ..] => println!(
18 "array[0] = -1, array[1] = {} and all the other ones were ignored",
19 second
20 ),
21 // The code below would not compile
22 // [-1, second] => ...
23
24 // Or store them in another array/slice (the type depends on
25 // that of the value that is being matched against)
26 [3, second, tail @ ..] => println!(
27 "array[0] = 3, array[1] = {} and the other elements were {:?}",
28 second, tail
29 ),
30
31 // Combining these patterns, we can, for example, bind the first and
32 // last values, and store the rest of them in a single array
33 [first, middle @ .., last] => println!(
34 "array[0] = {}, middle = {:?}, array[2] = {}",
35 first, middle, last
36 ),
37 }
38 }
See also:
enums
An enum is destructured similarly:
See also:
pointers/ref
For pointers, a distinction needs to be made between destructuring and dereferencing as they are different
concepts which are used differently from languages like C/C++.
Dereferencing uses *
Destructuring uses & , ref , and ref mut
1 fn main() {
2 // Assign a reference of type `i32`. The `&` signifies there
3 // is a reference being assigned.
4 let reference = &4;
5
6 match reference {
7 // If `reference` is pattern matched against `&val`, it results
8 // in a comparison like:
9 // `&i32`
10 // `&val`
11 // ^ We see that if the matching `&`s are dropped, then the `i32`
12 // should be assigned to `val`.
13 &val => println!("Got a value via destructuring: {:?}", val),
14 }
15
16 // To avoid the `&`, you dereference before matching.
17 match *reference {
18 val => println!("Got a value via dereferencing: {:?}", val),
19 }
20
21 // What if you don't start with a reference? `reference` was a `&`
22 // because the right side was already a reference. This is not
23 // a reference because the right side is not one.
24 let _not_a_reference = 3;
25
26 // Rust provides `ref` for exactly this purpose. It modifies the
27 // assignment so that a reference is created for the element; this
28 // reference is assigned.
29 let ref _is_a_reference = 3;
30
31 // Accordingly, by defining 2 values without references, references
32 // can be retrieved via `ref` and `ref mut`.
33 let value = 5;
34 let mut mut_value = 6;
35
36 // Use `ref` keyword to create a reference.
37 match value {
38 ref r => println!("Got a reference to a value: {:?}", r),
39 }
40
41 // Use `ref mut` similarly.
42 match mut_value {
43 ref mut m => {
44 // Got a reference. Gotta dereference it before we can
45 // add anything to it.
46 *m += 10;
47 println!("We added 10. `mut_value`: {:?}", m);
48 },
49 }
50 }
See also: 124
structs
Similarly, a struct can be destructured as shown:
1 fn main() {
2 struct Foo {
3 x: (u32, u32),
4 y: u32,
5 }
6
7 // Try changing the values in the struct to see what happens
8 let foo = Foo { x: (1, 2), y: 3 };
9
10 match foo {
11 Foo { x: (1, b), y } => println!("First of x is 1, b = {}, y = {} ", b, y),
12
13 // you can destructure structs and rename the variables,
14 // the order is not important
15 Foo { y: 2, x: i } => println!("y is 2, i = {:?}", i),
16
17 // and you can also ignore some variables:
18 Foo { y, .. } => println!("y = {}, we don't care about x", y),
19 // this will give an error: pattern does not mention field `x`
20 //Foo { y } => println!("y = {}", y),
21 }
22
23 let faa = Foo { x: (1, 2), y: 3 };
24
25 // You do not need a match block to destructure structs:
26 let Foo { x : x0, y: y0 } = faa;
27 println!("Outside: x0 = {x0:?}, y0 = {y0}");
28
29 // Destructuring works with nested structs as well:
30 struct Bar {
31 foo: Foo,
32 }
33
34 let bar = Bar { foo: faa };
35 let Bar { foo: Foo { x: nested_x, y: nested_y } } = bar;
36 println!("Nested: nested_x = {nested_x:?}, nested_y = {nested_y:?}");
37 }
See also:
Structs
127
128
Guards
A match guard can be added to filter the arm.
1 #[allow(dead_code)]
2 enum Temperature {
3 Celsius(i32),
4 Fahrenheit(i32),
5 }
6
7 fn main() {
8 let temperature = Temperature::Celsius(35);
9 // ^ TODO try different values for `temperature`
10
11 match temperature {
12 Temperature::Celsius(t) if t > 30 => println!("{}C is above 30 Celsius", t),
13 // The `if condition` part ^ is a guard
14 Temperature::Celsius(t) => println!("{}C is equal to or below 30 Celsius", t),
15
16 Temperature::Fahrenheit(t) if t > 86 => println!("{}F is above 86 Fahrenheit", t
17 Temperature::Fahrenheit(t) => println!("{}F is equal to or below 86 Fahrenheit"
18 }
19 }
Note that the compiler won't take guard conditions into account when checking if all patterns are covered by
the match expression.
1 fn main() {
2 let number: u8 = 4;
3
4 match number {
5 i if i == 0 => println!("Zero"),
6 i if i > 0 => println!("Greater than zero"),
7 // _ => unreachable!("Should never happen."),
8 // TODO ^ uncomment to fix compilation
9 }
10 }
See also:
Tuples Enums
129
130
Binding
Indirectly accessing a variable makes it impossible to branch and use that variable without re-binding.
match provides the @ sigil for binding values to names:
You can also use binding to "destructure" enum variants, such as Option :
See also:
if let
For some use cases, when matching enums, match is awkward. For example:
match optional {
Some(i) => println!("This is a really long string and `{:?}`", i),
_ => {},
// ^ Required because `match` is exhaustive. Doesn't it seem
// like wasted space?
};
if let is cleaner for this use case and in addition allows various failure options to be specified:
1 fn main() {
2 // All have type `Option<i32>`
3 let number = Some(7);
4 let letter: Option<i32> = None;
5 let emoticon: Option<i32> = None;
6
7 // The `if let` construct reads: "if `let` destructures `number` into
8 // `Some(i)`, evaluate the block (`{}`).
9 if let Some(i) = number {
10 println!("Matched {:?}!", i);
11 }
12
13 // If you need to specify a failure, use an else:
14 if let Some(i) = letter {
15 println!("Matched {:?}!", i);
16 } else {
17 // Destructure failed. Change to the failure case.
18 println!("Didn't match a number. Let's go with a letter!");
19 }
20
21 // Provide an altered failing condition.
22 let i_like_letters = false;
23
24 if let Some(i) = emoticon {
25 println!("Matched {:?}!", i);
26 // Destructure failed. Evaluate an `else if` condition to see if the
27 // alternate failure branch should be taken:
28 } else if i_like_letters {
29 println!("Didn't match a number. Let's go with a letter!");
30 } else {
31 // The condition evaluated false. This branch is the default:
32 println!("I don't like letters. Let's go with an emoticon :)!");
33 }
34 }
In the same way, if let can be used to match any enum value:
1 // Our example enum 133
2 enum Foo {
3 Bar,
4 Baz,
5 Qux(u32)
6 }
7
8 fn main() {
9 // Create example variables
10 let a = Foo::Bar;
11 let b = Foo::Baz;
12 let c = Foo::Qux(100);
13
14 // Variable a matches Foo::Bar
15 if let Foo::Bar = a {
16 println!("a is foobar");
17 }
18
19 // Variable b does not match Foo::Bar
20 // So this will print nothing
21 if let Foo::Bar = b {
22 println!("b is foobar");
23 }
24
25 // Variable c matches Foo::Qux which has a value
26 // Similar to Some() in the previous example
27 if let Foo::Qux(value) = c {
28 println!("c is {}", value);
29 }
30
31 // Binding also works with `if let`
32 if let Foo::Qux(value @ 100) = c {
33 println!("c is one hundred");
34 }
35 }
Another benefit is that if let allows us to match non-parameterized enum variants. This is true even in
cases where the enum doesn't implement or derive PartialEq . In such cases if Foo::Bar == a would
fail to compile, because instances of the enum cannot be equated, however if let will continue to work.
Would you like a challenge? Fix the following example to use if let :
See also:
let-else
🛈 you can target specific edition by compiling like this rustc --edition=2021 main.rs
With let - else , a refutable pattern can match and bind variables in the surrounding scope like a normal
let , or else diverge (e.g. break , return , panic! ) when the pattern doesn't match.
use std::str::FromStr;
fn main() {
assert_eq!(get_count_item("3 chairs"), (3, "chairs"));
}
The scope of name bindings is the main thing that makes this different from match or if let - else
expressions. You could previously approximate these patterns with an unfortunate bit of repetition and an
outer let :
See also:
while let
Similar to if let , while let can make awkward match sequences more tolerable. Consider the
following sequence that increments i :
1 fn main() {
2 // Make `optional` of type `Option<i32>`
3 let mut optional = Some(0);
4
5 // This reads: "while `let` destructures `optional` into
6 // `Some(i)`, evaluate the block (`{}`). Else `break`.
7 while let Some(i) = optional {
8 if i > 9 {
9 println!("Greater than 9, quit!");
10 optional = None;
11 } else {
12 println!("`i` is `{:?}`. Try again.", i);
13 optional = Some(i + 1);
14 }
15 // ^ Less rightward drift and doesn't require
16 // explicitly handling the failing case.
17 }
18 // ^ `if let` had additional optional `else`/`else if`
19 // clauses. `while let` does not have these.
20 }
See also:
Functions
Functions are declared using the fn keyword. Its arguments are type annotated, just like variables, and, if
the function returns a value, the return type must be specified after an arrow -> .
The final expression in the function will be used as return value. Alternatively, the return statement can
be used to return a value earlier from within the function, even from inside loops or if statements.
Closures
Closures are functions that can capture the enclosing environment. For example, a closure that captures the
x variable:
|val| val + x
The syntax and capabilities of closures make them very convenient for on the fly usage. Calling a closure is
exactly like calling a function. However, both input and return types can be inferred and input variable
names must be specified.
1 fn main() {
2 let outer_var = 42;
3
4 // A regular function can't refer to variables in the enclosing environment
5 //fn function(i: i32) -> i32 { i + outer_var }
6 // TODO: uncomment the line above and see the compiler error. The compiler
7 // suggests that we define a closure instead.
8
9 // Closures are anonymous, here we are binding them to references.
10 // Annotation is identical to function annotation but is optional
11 // as are the `{}` wrapping the body. These nameless functions
12 // are assigned to appropriately named variables.
13 let closure_annotated = |i: i32| -> i32 { i + outer_var };
14 let closure_inferred = |i | i + outer_var ;
15
16 // Call the closures.
17 println!("closure_annotated: {}", closure_annotated(1));
18 println!("closure_inferred: {}", closure_inferred(1));
19 // Once closure's type has been inferred, it cannot be inferred again with another t
20 //println!("cannot reuse closure_inferred with another type: {}", closure_inferred(4
21 // TODO: uncomment the line above and see the compiler error.
22
23 // A closure taking no arguments which returns an `i32`.
24 // The return type is inferred.
25 let one = || 1;
26 println!("closure returning one: {}", one());
27
28 }
146
147
Capturing
Closures are inherently flexible and will do what the functionality requires to make the closure work
without annotation. This allows capturing to flexibly adapt to the use case, sometimes moving and
sometimes borrowing. Closures can capture variables:
by reference: &T
by mutable reference: &mut T
by value: T
They preferentially capture variables by reference and only go lower when required.
1 fn main() { 148
2 use std::mem;
3
4 let color = String::from("green");
5
6 // A closure to print `color` which immediately borrows (`&`) `color` and
7 // stores the borrow and closure in the `print` variable. It will remain
8 // borrowed until `print` is used the last time.
9 //
10 // `println!` only requires arguments by immutable reference so it doesn't
11 // impose anything more restrictive.
12 let print = || println!("`color`: {}", color);
13
14 // Call the closure using the borrow.
15 print();
16
17 // `color` can be borrowed immutably again, because the closure only holds
18 // an immutable reference to `color`.
19 let _reborrow = &color;
20 print();
21
22 // A move or reborrow is allowed after the final use of `print`
23 let _color_moved = color;
24
25
26 let mut count = 0;
27 // A closure to increment `count` could take either `&mut count` or `count`
28 // but `&mut count` is less restrictive so it takes that. Immediately
29 // borrows `count`.
30 //
31 // A `mut` is required on `inc` because a `&mut` is stored inside. Thus,
32 // calling the closure mutates `count` which requires a `mut`.
33 let mut inc = || {
34 count += 1;
35 println!("`count`: {}", count);
36 };
37
38 // Call the closure using a mutable borrow.
39 inc();
40
41 // The closure still mutably borrows `count` because it is called later.
42 // An attempt to reborrow will lead to an error.
43 // let _reborrow = &count;
44 // ^ TODO: try uncommenting this line.
45 inc();
46
47 // The closure no longer needs to borrow `&mut count`. Therefore, it is
48 // possible to reborrow without an error
49 let _count_reborrowed = &mut count;
50
51
52 // A non-copy type.
53 let movable = Box::new(3);
54
55 // `mem::drop` requires `T` so this must take by value. A copy type
56 // would copy into the closure leaving the original untouched.
57 // A non-copy must move and so `movable` immediately moves into
58 // the closure.
59 let consume = || {
60 println!("`movable`: {:?}", movable);
61 mem::drop(movable);
62 };
63
64 // `consume` consumes the variable so this can only be called once. 149
65 consume();
66 // consume();
67 // ^ TODO: Try uncommenting this line.
68 }
Using move before vertical pipes forces closure to take ownership of captured variables:
1 fn main() {
2 // `Vec` has non-copy semantics.
3 let haystack = vec![1, 2, 3];
4
5 let contains = move |needle| haystack.contains(needle);
6
7 println!("{}", contains(&1));
8 println!("{}", contains(&4));
9
10 // println!("There're {} elements in vec", haystack.len());
11 // ^ Uncommenting above line will result in compile-time error
12 // because borrow checker doesn't allow re-using variable after it
13 // has been moved.
14
15 // Removing `move` from closure's signature will cause closure
16 // to borrow _haystack_ variable immutably, hence _haystack_ is still
17 // available and uncommenting above line will not cause an error.
18 }
See also:
As input parameters
While Rust chooses how to capture variables on the fly mostly without type annotation, this ambiguity is
not allowed when writing functions. When taking a closure as an input parameter, the closure's complete
type must be annotated using one of a few traits , and they're determined by what the closure does with
captured value. In order of decreasing restriction, they are:
On a variable-by-variable basis, the compiler will capture variables in the least restrictive manner possible.
For instance, consider a parameter annotated as FnOnce . This specifies that the closure may capture by
&T , &mut T , or T , but the compiler will ultimately choose based on how the captured variables are used
in the closure.
This is because if a move is possible, then any type of borrow should also be possible. Note that the
reverse is not true. If the parameter is annotated as Fn , then capturing variables by &mut T or T are not
allowed. However, &T is allowed.
In the following example, try swapping the usage of Fn , FnMut , and FnOnce to see what happens:
1 // A function which takes a closure as an argument and calls it. 152
2 // <F> denotes that F is a "Generic type parameter"
3 fn apply<F>(f: F) where
4 // The closure takes no input and returns nothing.
5 F: FnOnce() {
6 // ^ TODO: Try changing this to `Fn` or `FnMut`.
7
8 f();
9 }
10
11 // A function which takes a closure and returns an `i32`.
12 fn apply_to_3<F>(f: F) -> i32 where
13 // The closure takes an `i32` and returns an `i32`.
14 F: Fn(i32) -> i32 {
15
16 f(3)
17 }
18
19 fn main() {
20 use std::mem;
21
22 let greeting = "hello";
23 // A non-copy type.
24 // `to_owned` creates owned data from borrowed one
25 let mut farewell = "goodbye".to_owned();
26
27 // Capture 2 variables: `greeting` by reference and
28 // `farewell` by value.
29 let diary = || {
30 // `greeting` is by reference: requires `Fn`.
31 println!("I said {}.", greeting);
32
33 // Mutation forces `farewell` to be captured by
34 // mutable reference. Now requires `FnMut`.
35 farewell.push_str("!!!");
36 println!("Then I screamed {}.", farewell);
37 println!("Now I can sleep. zzzzz");
38
39 // Manually calling drop forces `farewell` to
40 // be captured by value. Now requires `FnOnce`.
41 mem::drop(farewell);
42 };
43
44 // Call the function which applies the closure.
45 apply(diary);
46
47 // `double` satisfies `apply_to_3`'s trait bound
48 let double = |x| 2 * x;
49
50 println!("3 doubled: {}", apply_to_3(double));
51 }
See also:
Type anonymity
Closures succinctly capture variables from enclosing scopes. Does this have any consequences? It surely
does. Observe how using a closure as a function parameter requires generics, which is necessary because
of how they are defined:
When a closure is defined, the compiler implicitly creates a new anonymous structure to store the captured
variables inside, meanwhile implementing the functionality via one of the traits : Fn , FnMut , or FnOnce
for this unknown type. This type is assigned to the variable which is stored until calling.
Since this new type is of unknown type, any usage in a function will require generics. However, an
unbounded type parameter <T> would still be ambiguous and not be allowed. Thus, bounding by one of
the traits : Fn , FnMut , or FnOnce (which it implements) is sufficient to specify its type.
See also:
Input functions
Since closures may be used as arguments, you might wonder if the same can be said about functions. And
indeed they can! If you declare a function that takes a closure as parameter, then any function that satisfies
the trait bound of that closure can be passed as a parameter.
As an additional note, the Fn , FnMut , and FnOnce traits dictate how a closure captures variables from
the enclosing scope.
See also:
As output parameters
Closures as input parameters are possible, so returning closures as output parameters should also be
possible. However, anonymous closure types are, by definition, unknown, so we have to use impl Trait
to return them.
Fn
FnMut
FnOnce
Beyond this, the move keyword must be used, which signals that all captures occur by value. This is
required because any captures by reference would be dropped as soon as the function exited, leaving
invalid references in the closure.
See also:
Examples in std
This section contains a few examples of using closures from the std library.
161
162
Iterator::any
Iterator::any is a function which when passed an iterator, will return true if any element satisfies the
predicate. Otherwise false . Its signature:
1 fn main() {
2 let vec1 = vec![1, 2, 3];
3 let vec2 = vec![4, 5, 6];
4
5 // `iter()` for vecs yields `&i32`. Destructure to `i32`.
6 println!("2 in vec1: {}", vec1.iter() .any(|&x| x == 2));
7 // `into_iter()` for vecs yields `i32`. No destructuring required.
8 println!("2 in vec2: {}", vec2.into_iter().any(|x| x == 2));
9
10 // `iter()` only borrows `vec1` and its elements, so they can be used again
11 println!("vec1 len: {}", vec1.len());
12 println!("First element of vec1 is: {}", vec1[0]);
13 // `into_iter()` does move `vec2` and its elements, so they cannot be used again
14 // println!("First element of vec2 is: {}", vec2[0]);
15 // println!("vec2 len: {}", vec2.len());
16 // TODO: uncomment two lines above and see compiler errors.
17
18 let array1 = [1, 2, 3];
19 let array2 = [4, 5, 6];
20
21 // `iter()` for arrays yields `&i32`.
22 println!("2 in array1: {}", array1.iter() .any(|&x| x == 2));
23 // `into_iter()` for arrays yields `i32`.
24 println!("2 in array2: {}", array2.into_iter().any(|x| x == 2));
25 }
See also:
std::iter::Iterator::any
163
164
1 fn main() {
2 let vec1 = vec![1, 2, 3];
3 let vec2 = vec![4, 5, 6];
4
5 // `iter()` for vecs yields `&i32`.
6 let mut iter = vec1.iter();
7 // `into_iter()` for vecs yields `i32`.
8 let mut into_iter = vec2.into_iter();
9
10 // `iter()` for vecs yields `&i32`, and we want to reference one of its
11 // items, so we have to destructure `&&i32` to `i32`
12 println!("Find 2 in vec1: {:?}", iter .find(|&&x| x == 2));
13 // `into_iter()` for vecs yields `i32`, and we want to reference one of
14 // its items, so we have to destructure `&i32` to `i32`
15 println!("Find 2 in vec2: {:?}", into_iter.find(| &x| x == 2));
16
17 let array1 = [1, 2, 3];
18 let array2 = [4, 5, 6];
19
20 // `iter()` for arrays yields `&&i32`
21 println!("Find 2 in array1: {:?}", array1.iter() .find(|&&x| x == 2));
22 // `into_iter()` for arrays yields `&i32`
23 println!("Find 2 in array2: {:?}", array2.into_iter().find(|&x| x == 2));
24 }
Iterator::find gives you a reference to the item. But if you want the index of the item, use
Iterator::position .
1 fn main() { 165
2 let vec = vec![1, 9, 3, 3, 13, 2];
3
4 // `iter()` for vecs yields `&i32` and `position()` does not take a reference, so
5 // we have to destructure `&i32` to `i32`
6 let index_of_first_even_number = vec.iter().position(|&x| x % 2 == 0);
7 assert_eq!(index_of_first_even_number, Some(5));
8
9 // `into_iter()` for vecs yields `i32` and `position()` does not take a reference, s
10 // we do not have to destructure
11 let index_of_first_negative_number = vec.into_iter().position(|x| x < 0);
12 assert_eq!(index_of_first_negative_number, None);
13 }
See also:
std::iter::Iterator::find
std::iter::Iterator::find_map
std::iter::Iterator::position
std::iter::Iterator::rposition
166
167
Diverging functions
Diverging functions never return. They are marked using ! , which is an empty type.
fn foo() -> ! {
panic!("This call never returns.");
}
As opposed to all the other types, this one cannot be instantiated, because the set of all possible values
this type can have is empty. Note that, it is different from the () type, which has exactly one possible
value.
For example, this function returns as usual, although there is no information in the return value.
fn some_fn() {
()
}
fn main() {
let _a: () = some_fn();
println!("This function returns and you can see this line.");
}
As opposed to this function, which will never return the control back to the caller.
#![feature(never_type)]
fn main() {
let x: ! = panic!("This call never returns.");
println!("You will never see this line!");
}
Although this might seem like an abstract concept, it is actually very useful and often handy. The main
advantage of this type is that it can be cast to any other type, making it versatile in situations where an exact
type is required, such as in match branches. This flexibility allows us to write code like this:
fn main() { 170
fn sum_odd_numbers(up_to: u32) -> u32 {
let mut acc = 0;
for i in 0..up_to {
// Notice that the return type of this match expression must be u32
// because of the type of the "addition" variable.
let addition: u32 = match i%2 == 1 {
// The "i" variable is of type u32, which is perfectly fine.
true => i,
// On the other hand, the "continue" expression does not return
// u32, but it is still fine, because it never returns and therefore
// does not violate the type requirements of the match expression.
false => continue,
};
acc += addition;
}
acc
}
println!("Sum of odd numbers up to 9 (excluding): {}", sum_odd_numbers(9));
}
It is also the return type of functions that loop forever (e.g. loop {} ) like network servers or functions that
terminate the process (e.g. exit() ).
171
172
Modules
Rust provides a powerful module system that can be used to hierarchically split code in logical units
(modules), and manage visibility (public/private) between them.
A module is a collection of items: functions, structs, traits, impl blocks, and even other modules.
173
174
Visibility
By default, the items in a module have private visibility, but this can be overridden with the pub modifier.
Only the public items of a module can be accessed from outside the module scope.
1 // A module named `my_mod` 175
2 mod my_mod {
3 // Items in modules default to private visibility.
4 fn private_function() {
5 println!("called `my_mod::private_function()`");
6 }
7
8 // Use the `pub` modifier to override default visibility.
9 pub fn function() {
10 println!("called `my_mod::function()`");
11 }
12
13 // Items can access other items in the same module,
14 // even when private.
15 pub fn indirect_access() {
16 print!("called `my_mod::indirect_access()`, that\n> ");
17 private_function();
18 }
19
20 // Modules can also be nested
21 pub mod nested {
22 pub fn function() {
23 println!("called `my_mod::nested::function()`");
24 }
25
26 #[allow(dead_code)]
27 fn private_function() {
28 println!("called `my_mod::nested::private_function()`");
29 }
30
31 // Functions declared using `pub(in path)` syntax are only visible
32 // within the given path. `path` must be a parent or ancestor module
33 pub(in crate::my_mod) fn public_function_in_my_mod() {
34 print!("called `my_mod::nested::public_function_in_my_mod()`, that\n> ");
35 public_function_in_nested();
36 }
37
38 // Functions declared using `pub(self)` syntax are only visible within
39 // the current module, which is the same as leaving them private
40 pub(self) fn public_function_in_nested() {
41 println!("called `my_mod::nested::public_function_in_nested()`");
42 }
43
44 // Functions declared using `pub(super)` syntax are only visible within
45 // the parent module
46 pub(super) fn public_function_in_super_mod() {
47 println!("called `my_mod::nested::public_function_in_super_mod()`");
48 }
49 }
50
51 pub fn call_public_function_in_my_mod() {
52 print!("called `my_mod::call_public_function_in_my_mod()`, that\n> ");
53 nested::public_function_in_my_mod();
54 print!("> ");
55 nested::public_function_in_super_mod();
56 }
57
58 // pub(crate) makes functions visible only within the current crate
59 pub(crate) fn public_function_in_crate() {
60 println!("called `my_mod::public_function_in_crate()`");
61 }
62
63 // Nested modules follow the same rules for visibility
64 mod private_nested { 176
65 #[allow(dead_code)]
66 pub fn function() {
67 println!("called `my_mod::private_nested::function()`");
68 }
69
70 // Private parent items will still restrict the visibility of a child item,
71 // even if it is declared as visible within a bigger scope.
72 #[allow(dead_code)]
73 pub(crate) fn restricted_function() {
74 println!("called `my_mod::private_nested::restricted_function()`");
75 }
76 }
77 }
78
79 fn function() {
80 println!("called `function()`");
81 }
82
83 fn main() {
84 // Modules allow disambiguation between items that have the same name.
85 function();
86 my_mod::function();
87
88 // Public items, including those inside nested modules, can be
89 // accessed from outside the parent module.
90 my_mod::indirect_access();
91 my_mod::nested::function();
92 my_mod::call_public_function_in_my_mod();
93
94 // pub(crate) items can be called from anywhere in the same crate
95 my_mod::public_function_in_crate();
96
97 // pub(in path) items can only be called from within the module specified
98 // Error! function `public_function_in_my_mod` is private
99 //my_mod::nested::public_function_in_my_mod();
100 // TODO ^ Try uncommenting this line
101
102 // Private items of a module cannot be directly accessed, even if
103 // nested in a public module:
104
105 // Error! `private_function` is private
106 //my_mod::private_function();
107 // TODO ^ Try uncommenting this line
108
109 // Error! `private_function` is private
110 //my_mod::nested::private_function();
111 // TODO ^ Try uncommenting this line
112
113 // Error! `private_nested` is a private module
114 //my_mod::private_nested::function();
115 // TODO ^ Try uncommenting this line
116
117 // Error! `private_nested` is a private module
118 //my_mod::private_nested::restricted_function();
119 // TODO ^ Try uncommenting this line
120 }
177
178
Struct visibility
Structs have an extra level of visibility with their fields. The visibility defaults to private, and can be
overridden with the pub modifier. This visibility only matters when a struct is accessed from outside the
module where it is defined, and has the goal of hiding information (encapsulation).
1 mod my {
2 // A public struct with a public field of generic type `T`
3 pub struct OpenBox<T> {
4 pub contents: T,
5 }
6
7 // A public struct with a private field of generic type `T`
8 pub struct ClosedBox<T> {
9 contents: T,
10 }
11
12 impl<T> ClosedBox<T> {
13 // A public constructor method
14 pub fn new(contents: T) -> ClosedBox<T> {
15 ClosedBox {
16 contents: contents,
17 }
18 }
19 }
20 }
21
22 fn main() {
23 // Public structs with public fields can be constructed as usual
24 let open_box = my::OpenBox { contents: "public information" };
25
26 // and their fields can be normally accessed.
27 println!("The open box contains: {}", open_box.contents);
28
29 // Public structs with private fields cannot be constructed using field names.
30 // Error! `ClosedBox` has private fields
31 //let closed_box = my::ClosedBox { contents: "classified information" };
32 // TODO ^ Try uncommenting this line
33
34 // However, structs with private fields can be created using
35 // public constructors
36 let _closed_box = my::ClosedBox::new("classified information");
37
38 // and the private fields of a public struct cannot be accessed.
39 // Error! The `contents` field is private
40 //println!("The closed box contains: {}", _closed_box.contents);
41 // TODO ^ Try uncommenting this line
42 }
See also:
1 use crate::deeply::nested::{
2 my_first_function,
3 my_second_function,
4 AndATraitType
5 };
6
7 fn main() {
8 my_first_function();
9 }
1 fn function() {
2 println!("called `function()`");
3 }
4
5 mod cool {
6 pub fn function() {
7 println!("called `cool::function()`");
8 }
9 }
10
11 mod my {
12 fn function() {
13 println!("called `my::function()`");
14 }
15
16 mod cool {
17 pub fn function() {
18 println!("called `my::cool::function()`");
19 }
20 }
21
22 pub fn indirect_call() {
23 // Let's access all the functions named `function` from this scope!
24 print!("called `my::indirect_call()`, that\n> ");
25
26 // The `self` keyword refers to the current module scope - in this case `my`.
27 // Calling `self::function()` and calling `function()` directly both give
28 // the same result, because they refer to the same function.
29 self::function();
30 function();
31
32 // We can also use `self` to access another module inside `my`:
33 self::cool::function();
34
35 // The `super` keyword refers to the parent scope (outside the `my` module).
36 super::function();
37
38 // This will bind to the `cool::function` in the *crate* scope.
39 // In this case the crate scope is the outermost scope.
40 {
41 use crate::cool::function as root_function;
42 root_function();
43 }
44 }
45 }
46
47 fn main() {
48 my::indirect_call();
49 }
183
184
File hierarchy
Modules can be mapped to a file/directory hierarchy. Let's break down the visibility example in files:
$ tree .
.
├── my
│ ├── inaccessible.rs
│ └── nested.rs
├── my.rs
└── split.rs
In split.rs :
// This declaration will look for a file named `my.rs` and will
// insert its contents inside a module named `my` under this scope
mod my;
fn function() {
println!("called `function()`");
}
fn main() {
my::function();
function();
my::indirect_access();
my::nested::function();
}
In my.rs :
// Similarly `mod inaccessible` and `mod nested` will locate the `nested.rs`
// and `inaccessible.rs` files and insert them here under their respective
// modules
mod inaccessible;
pub mod nested;
pub fn function() {
println!("called `my::function()`");
}
fn private_function() {
println!("called `my::private_function()`");
}
pub fn indirect_access() {
print!("called `my::indirect_access()`, that\n> ");
private_function();
}
In my/nested.rs :
pub fn function() { 185
println!("called `my::nested::function()`");
}
#[allow(dead_code)]
fn private_function() {
println!("called `my::nested::private_function()`");
}
In my/inaccessible.rs :
#[allow(dead_code)]
pub fn public_function() {
println!("called `my::inaccessible::public_function()`");
}
Crates
A crate is a compilation unit in Rust. Whenever rustc some_file.rs is called, some_file.rs is treated as
the crate file. If some_file.rs has mod declarations in it, then the contents of the module files would be
inserted in places where mod declarations in the crate file are found, before running the compiler over it. In
other words, modules do not get compiled individually, only crates get compiled.
A crate can be compiled into a binary or into a library. By default, rustc will produce a binary from a crate.
This behavior can be overridden by passing the --crate-type flag to lib .
188
189
Creating a Library
Let's create a library, and then see how to link it to another crate.
In rary.rs :
pub fn public_function() {
println!("called rary's `public_function()`");
}
fn private_function() {
println!("called rary's `private_function()`");
}
pub fn indirect_access() {
print!("called rary's `indirect_access()`, that\n> ");
private_function();
}
Libraries get prefixed with "lib", and by default they get named after their crate file, but this default name
can be overridden by passing the --crate-name option to rustc or by using the crate_name attribute.
190
191
Using a Library
To link a crate to this new library you may use rustc 's --extern flag. All of its items will then be
imported under a module named the same as the library. This module generally behaves the same way as
any other module.
// extern crate rary; // May be required for Rust 2015 edition or earlier
fn main() {
rary::public_function();
rary::indirect_access();
}
# Where library.rlib is the path to the compiled library, assumed that it's
# in the same directory here:
$ rustc executable.rs --extern rary=library.rlib && ./executable
called rary's `public_function()`
called rary's `indirect_access()`, that
> called rary's `private_function()`
192
193
Cargo
cargo is the official Rust package management tool. It has lots of really useful features to improve code
quality and developer velocity! These include
Dependency management and integration with crates.io (the official Rust package registry)
Awareness of unit tests
Awareness of benchmarks
This chapter will go through some quick basics, but you can find the comprehensive docs in The Cargo
Book.
194
195
Dependencies
Most programs have dependencies on some libraries. If you have ever managed dependencies by hand, you
know how much of a pain this can be. Luckily, the Rust ecosystem comes standard with cargo ! cargo can
manage dependencies for a project.
# A binary
cargo new foo
# A library
cargo new --lib bar
For the rest of this chapter, let's assume we are making a binary, rather than a library, but all of the concepts
are the same.
After the above commands, you should see a file hierarchy like this:
.
├── bar
│ ├── Cargo.toml
│ └── src
│ └── lib.rs
└── foo
├── Cargo.toml
└── src
└── main.rs
The main.rs is the root source file for your new foo project -- nothing new there. The Cargo.toml is the
config file for cargo for this project. If you look inside it, you should see something like this:
[package]
name = "foo"
version = "0.1.0"
authors = ["mark"]
[dependencies]
The name field under [package] determines the name of the project. This is used by crates.io if you
publish the crate (more later). It is also the name of the output binary when you compile.
The authors field is a list of authors used when publishing the crate.
The [dependencies] section lets you add dependencies for your project.
For example, suppose that we want our program to have a great CLI. You can find lots of great packages on
crates.io (the official Rust package registry). One popular choice is clap. As of this writing, the most recent
published version of clap is 2.27.1 . To add a dependency to our program, we can simply add the
following to our Cargo.toml under [dependencies] : clap = "2.27.1" . And that's it! You can start using196
clap in your program.
cargo also supports other types of dependencies. Here is just a small sampling:
[package]
name = "foo"
version = "0.1.0"
authors = ["mark"]
[dependencies]
clap = "2.27.1" # from crates.io
rand = { git = "https://github.com/rust-lang-nursery/rand" } # from online repo
bar = { path = "../bar" } # from a path in the local filesystem
cargo is more than a dependency manager. All of the available configuration options are listed in the
format specification of Cargo.toml .
To build our project we can execute cargo build anywhere in the project directory (including
subdirectories!). We can also do cargo run to build and run. Notice that these commands will resolve all
dependencies, download crates if needed, and build everything, including your crate. (Note that it only
rebuilds what it has not already built, similar to make ).
Conventions
In the previous chapter, we saw the following directory hierarchy:
foo
├── Cargo.toml
└── src
└── main.rs
Suppose that we wanted to have two binaries in the same project, though. What then?
It turns out that cargo supports this. The default binary name is main , as we saw before, but you can add
additional binaries by placing them in a bin/ directory:
foo
├── Cargo.toml
└── src
├── main.rs
└── bin
└── my_other_bin.rs
To tell cargo to only compile or run this binary, we just pass cargo the --bin my_other_bin flag, where
my_other_bin is the name of the binary we want to work with.
In addition to extra binaries, cargo supports more features such as benchmarks, tests, and examples.
Testing
As we know testing is integral to any piece of software! Rust has first-class support for unit and integration
testing (see this chapter in TRPL).
From the testing chapters linked above, we see how to write unit tests and integration tests.
Organizationally, we can place unit tests in the modules they test and integration tests in their own tests/
directory:
foo
├── Cargo.toml
├── src
│ └── main.rs
│ └── lib.rs
└── tests
├── my_test.rs
└── my_other_test.rs
Each file in tests is a separate integration test, i.e. a test that is meant to test your library as if it were
being called from a dependent crate.
The Testing chapter elaborates on the three different testing styles: Unit, Doc, and Integration.
$ cargo test
$ cargo test
Compiling blah v0.1.0 (file:///nobackup/blah)
Finished dev [unoptimized + debuginfo] target(s) in 0.89 secs
Running target/debug/deps/blah-d3b32b97275ec472
running 4 tests
test test_bar ... ok
test test_baz ... ok
test test_foo_bar ... ok
test test_foo ... ok
running 2 tests
test test_foo ... ok
test test_foo_bar ... ok
One word of caution: Cargo may run multiple tests concurrently, so make sure that they don't race with
each other.
One example of this concurrency causing issues is if two tests output to a file, such as below:
#[cfg(test)]
mod tests {
// Import the necessary modules
use std::fs::OpenOptions;
use std::io::Write;
Build Scripts
Sometimes a normal build from cargo is not enough. Perhaps your crate needs some pre-requisites
before cargo will successfully compile, things like code generation, or some native code that needs to be
compiled. To solve this problem we have build scripts that Cargo can run.
To add a build script to your package it can either be specified in the Cargo.toml as follows:
[package]
...
build = "build.rs"
Otherwise Cargo will look for a build.rs file in the project directory by default.
Cargo provides the script with inputs via environment variables specified here that can be used.
The script provides output via stdout. All lines printed are written to target/debug/build/<pkg>/output .
Further, lines prefixed with cargo: will be interpreted by Cargo directly and hence can be used to define
parameters for the package's compilation.
For further specification and examples have a read of the Cargo specification.
205
206
Attributes
An attribute is metadata applied to some module, crate or item. This metadata can be used to/for:
Attributes look like #[outer_attribute] or #![inner_attribute] , with the difference between them
being where they apply.
#[outer_attribute] applies to the item immediately following it. Some examples of items are: a
function, a module declaration, a constant, a structure, an enum. Here is an example where attribute #
[derive(Debug)] applies to the struct Rectangle :
#[derive(Debug)]
struct Rectangle {
width: u32,
height: u32,
}
#![inner_attribute] applies to the enclosing item (typically a module or a crate). In other words,
this attribute is interpreted as applying to the entire scope in which it's placed. Here is an example
where #![allow(unused_variables)] applies to the whole crate (if placed in main.rs ):
#![allow(unused_variables)]
fn main() {
let x = 3; // This would normally warn about an unused variable.
}
#[attribute = "value"]
#[attribute(key = "value")]
#[attribute(value)]
Attributes can have multiple values and can be separated over multiple lines, too:
#[attribute(value, value2)] 207
dead_code
The compiler provides a dead_code lint that will warn about unused functions. An attribute can be used to
disable the lint.
1 fn used_function() {}
2
3 // `#[allow(dead_code)]` is an attribute that disables the `dead_code` lint
4 #[allow(dead_code)]
5 fn unused_function() {}
6
7 fn noisy_unused_function() {}
8 // FIXME ^ Add an attribute to suppress the warning
9
10 fn main() {
11 used_function();
12 }
Note that in real programs, you should eliminate dead code. In these examples we'll allow dead code in
some places because of the interactive nature of the examples.
210
211
Crates
The crate_type attribute can be used to tell the compiler whether a crate is a binary or a library (and even
which type of library), and the crate_name attribute can be used to set the name of the crate.
However, it is important to note that both the crate_type and crate_name attributes have no effect
whatsoever when using Cargo, the Rust package manager. Since Cargo is used for the majority of Rust
projects, this means real-world uses of crate_type and crate_name are relatively limited.
When the crate_type attribute is used, we no longer need to pass the --crate-type flag to rustc .
$ rustc lib.rs
$ ls lib*
library.rlib
212
213
cfg
Configuration conditional checks are possible through two different operators:
While the former enables conditional compilation, the latter conditionally evaluates to true or false
literals allowing for checks at run-time. Both utilize identical argument syntax.
cfg! , unlike #[cfg] , does not remove any code and only evaluates to true or false. For example, all
blocks in an if/else expression need to be valid when cfg! is used for the condition, regardless of what
cfg! is evaluating.
See also:
Custom
Some conditionals like target_os are implicitly provided by rustc , but custom conditionals must be
passed to rustc using the --cfg flag.
1 #[cfg(some_condition)]
2 fn conditional_function() {
3 println!("condition met!");
4 }
5
6 fn main() {
7 conditional_function();
8 }
Try to run this to see what happens without the custom cfg flag.
Generics
Generics is the topic of generalizing types and functionalities to broader cases. This is extremely useful for
reducing code duplication in many ways, but can call for rather involved syntax. Namely, being generic
requires taking great care to specify over which types a generic type is actually considered valid. The
simplest and most common use of generics is for type parameters.
A type parameter is specified as generic by the use of angle brackets and upper camel case: <Aaa, Bbb,
...> . "Generic type parameters" are typically represented as <T> . In Rust, "generic" also describes
anything that accepts one or more generic type parameters <T> . Any type specified as a generic type
parameter is generic, and everything else is concrete (non-generic).
For example, defining a generic function named foo that takes an argument T of any type:
fn foo<T>(arg: T) { ... }
Because T has been specified as a generic type parameter using <T> , it is considered generic when used
here as (arg: T) . This is the case even if T has previously been defined as a struct .
See also:
structs
218
219
Functions
The same set of rules can be applied to functions: a type T becomes generic when preceded by <T> .
Using generic functions sometimes requires explicitly specifying type parameters. This may be the case if
the function is called where the return type is generic, or if the compiler doesn't have enough information to
infer the necessary type parameters.
A function call with explicitly specified type parameters looks like: fun::<A, B, ...>() .
See also:
Implementation
Similar to functions, implementations require care to remain generic.
1 struct Val {
2 val: f64,
3 }
4
5 struct GenVal<T> {
6 gen_val: T,
7 }
8
9 // impl of Val
10 impl Val {
11 fn value(&self) -> &f64 {
12 &self.val
13 }
14 }
15
16 // impl of GenVal for a generic type `T`
17 impl<T> GenVal<T> {
18 fn value(&self) -> &T {
19 &self.gen_val
20 }
21 }
22
23 fn main() {
24 let x = Val { val: 3.0 };
25 let y = GenVal { gen_val: 3i32 };
26
27 println!("{}, {}", x.value(), y.value());
28 }
See also:
Traits
Of course trait s can also be generic. Here we define one which reimplements the Drop trait as a
generic method to drop itself and an input.
1 // Non-copyable types.
2 struct Empty;
3 struct Null;
4
5 // A trait generic over `T`.
6 trait DoubleDrop<T> {
7 // Define a method on the caller type which takes an
8 // additional single parameter `T` and does nothing with it.
9 fn double_drop(self, _: T);
10 }
11
12 // Implement `DoubleDrop<T>` for any generic parameter `T` and
13 // caller `U`.
14 impl<T, U> DoubleDrop<T> for U {
15 // This method takes ownership of both passed arguments,
16 // deallocating both.
17 fn double_drop(self, _: T) {}
18 }
19
20 fn main() {
21 let empty = Empty;
22 let null = Null;
23
24 // Deallocate `empty` and `null`.
25 empty.double_drop(null);
26
27 //empty;
28 //null;
29 // ^ TODO: Try uncommenting these lines.
30 }
See also:
Bounds
When working with generics, the type parameters often must use traits as bounds to stipulate what
functionality a type implements. For example, the following example uses the trait Display to print and so
it requires T to be bound by Display ; that is, T must implement Display .
Bounding restricts the generic to types that conform to the bounds. That is:
Another effect of bounding is that generic instances are allowed to access the methods of traits specified in
the bounds. For example:
1 // A trait which implements the print marker: `{:?}`. 226
2 use std::fmt::Debug;
3
4 trait HasArea {
5 fn area(&self) -> f64;
6 }
7
8 impl HasArea for Rectangle {
9 fn area(&self) -> f64 { self.length * self.height }
10 }
11
12 #[derive(Debug)]
13 struct Rectangle { length: f64, height: f64 }
14 #[allow(dead_code)]
15 struct Triangle { length: f64, height: f64 }
16
17 // The generic `T` must implement `Debug`. Regardless
18 // of the type, this will work properly.
19 fn print_debug<T: Debug>(t: &T) {
20 println!("{:?}", t);
21 }
22
23 // `T` must implement `HasArea`. Any type which meets
24 // the bound can access `HasArea`'s function `area`.
25 fn area<T: HasArea>(t: &T) -> f64 { t.area() }
26
27 fn main() {
28 let rectangle = Rectangle { length: 3.0, height: 4.0 };
29 let _triangle = Triangle { length: 3.0, height: 4.0 };
30
31 print_debug(&rectangle);
32 println!("Area: {}", area(&rectangle));
33
34 //print_debug(&_triangle);
35 //println!("Area: {}", area(&_triangle));
36 // ^ TODO: Try uncommenting these.
37 // | Error: Does not implement either `Debug` or `HasArea`.
38 }
As an additional note, where clauses can also be used to apply bounds in some cases to be more
expressive.
See also:
1 struct Cardinal;
2 struct BlueJay;
3 struct Turkey;
4
5 trait Red {}
6 trait Blue {}
7
8 impl Red for Cardinal {}
9 impl Blue for BlueJay {}
10
11 // These functions are only valid for types which implement these
12 // traits. The fact that the traits are empty is irrelevant.
13 fn red<T: Red>(_: &T) -> &'static str { "red" }
14 fn blue<T: Blue>(_: &T) -> &'static str { "blue" }
15
16 fn main() {
17 let cardinal = Cardinal;
18 let blue_jay = BlueJay;
19 let _turkey = Turkey;
20
21 // `red()` won't work on a blue jay nor vice versa
22 // because of the bounds.
23 println!("A cardinal is {}", red(&cardinal));
24 println!("A blue jay is {}", blue(&blue_jay));
25 //println!("A turkey is {}", red(&_turkey));
26 // ^ TODO: Try uncommenting this line.
27 }
See also:
Multiple bounds
Multiple bounds for a single type can be applied with a + . Like normal, different types are separated with
,.
See also:
Where clauses
A bound can also be expressed using a where clause immediately before the opening { , rather than at the
type's first mention. Additionally, where clauses can apply bounds to arbitrary types, rather than just to
type parameters.
impl <A: TraitB + TraitC, D: TraitE + TraitF> MyTrait<A, D> for YourType {}
When using a where clause is more expressive than using normal syntax. The impl in this example
cannot be directly expressed without a where clause:
1 use std::fmt::Debug;
2
3 trait PrintInOption {
4 fn print_in_option(self);
5 }
6
7 // Because we would otherwise have to express this as `T: Debug` or
8 // use another method of indirect approach, this requires a `where` clause:
9 impl<T> PrintInOption for T where
10 Option<T>: Debug {
11 // We want `Option<T>: Debug` as our bound because that is what's
12 // being printed. Doing otherwise would be using the wrong bound.
13 fn print_in_option(self) {
14 println!("{:?}", Some(self));
15 }
16 }
17
18 fn main() {
19 let vec = vec![1, 2, 3];
20
21 vec.print_in_option();
22 }
See also:
For example, an age verification function that checks age in years, must be given a value of type Years .
1 struct Years(i64);
2
3 struct Days(i64);
4
5 impl Years {
6 pub fn to_days(&self) -> Days {
7 Days(self.0 * 365)
8 }
9 }
10
11
12 impl Days {
13 /// truncates partial years
14 pub fn to_years(&self) -> Years {
15 Years(self.0 / 365)
16 }
17 }
18
19 fn is_adult(age: &Years) -> bool {
20 age.0 >= 18
21 }
22
23 fn main() {
24 let age = Years(25);
25 let age_days = age.to_days();
26 println!("Is an adult? {}", is_adult(&age));
27 println!("Is an adult? {}", is_adult(&age_days.to_years()));
28 // println!("Is an adult? {}", is_adult(&age_days));
29 }
Uncomment the last print statement to observe that the type supplied must be Years .
To obtain the newtype 's value as the base type, you may use the tuple or destructuring syntax like so:
1 struct Years(i64);
2
3 fn main() {
4 let years = Years(42);
5 let years_as_primitive_1: i64 = years.0; // Tuple
6 let Years(years_as_primitive_2) = years; // Destructuring
7 }
See also:
structs
235
236
Associated items
"Associated Items" refers to a set of rules pertaining to item s of various types. It is an extension to trait
generics, and allows trait s to internally define new items.
One such item is called an associated type, providing simpler usage patterns when the trait is generic
over its container type.
See also:
RFC
237
238
The Problem
A trait that is generic over its container type has type specification requirements - users of the trait
must specify all of its generic types.
In the example below, the Contains trait allows the use of the generic types A and B . The trait is then
implemented for the Container type, specifying i32 for A and B so that it can be used with fn
difference() .
Because Contains is generic, we are forced to explicitly state all of the generic types for fn
difference() . In practice, we want a way to express that A and B are determined by the input C . As you
will see in the next section, associated types provide exactly that capability.
Associated types
The use of "Associated types" improves the overall readability of code by moving inner types locally into a
trait as output types. Syntax for the trait definition is as follows:
// `A` and `B` are defined in the trait via the `type` keyword.
// (Note: `type` in this context is different from `type` when used for
// aliases).
trait Contains {
type A;
type B;
Note that functions that use the trait Contains are no longer required to express A or B at all:
Let's rewrite the example from the previous section using associated types:
1 struct Container(i32, i32); 242
2
3 // A trait which checks if 2 items are stored inside of container.
4 // Also retrieves first or last value.
5 trait Contains {
6 // Define generic types here which methods will be able to utilize.
7 type A;
8 type B;
9
10 fn contains(&self, _: &Self::A, _: &Self::B) -> bool;
11 fn first(&self) -> i32;
12 fn last(&self) -> i32;
13 }
14
15 impl Contains for Container {
16 // Specify what types `A` and `B` are. If the `input` type
17 // is `Container(i32, i32)`, the `output` types are determined
18 // as `i32` and `i32`.
19 type A = i32;
20 type B = i32;
21
22 // `&Self::A` and `&Self::B` are also valid here.
23 fn contains(&self, number_1: &i32, number_2: &i32) -> bool {
24 (&self.0 == number_1) && (&self.1 == number_2)
25 }
26 // Grab the first number.
27 fn first(&self) -> i32 { self.0 }
28
29 // Grab the last number.
30 fn last(&self) -> i32 { self.1 }
31 }
32
33 fn difference<C: Contains>(container: &C) -> i32 {
34 container.last() - container.first()
35 }
36
37 fn main() {
38 let number_1 = 3;
39 let number_2 = 10;
40
41 let container = Container(number_1, number_2);
42
43 println!("Does container contain {} and {}: {}",
44 &number_1, &number_2,
45 container.contains(&number_1, &number_2));
46 println!("First number: {}", container.first());
47 println!("Last number: {}", container.last());
48
49 println!("The difference is: {}", difference(&container));
50 }
243
244
Data types can use extra generic type parameters to act as markers or to perform type checking at compile
time. These extra parameters hold no storage values, and have no runtime behavior.
In the following example, we combine std::marker::PhantomData with the phantom type parameter concept
to create tuples containing different data types.
1 use std::marker::PhantomData;
2
3 // A phantom tuple struct which is generic over `A` with hidden parameter `B`.
4 #[derive(PartialEq)] // Allow equality test for this type.
5 struct PhantomTuple<A, B>(A, PhantomData<B>);
6
7 // A phantom type struct which is generic over `A` with hidden parameter `B`.
8 #[derive(PartialEq)] // Allow equality test for this type.
9 struct PhantomStruct<A, B> { first: A, phantom: PhantomData<B> }
10
11 // Note: Storage is allocated for generic type `A`, but not for `B`.
12 // Therefore, `B` cannot be used in computations.
13
14 fn main() {
15 // Here, `f32` and `f64` are the hidden parameters.
16 // PhantomTuple type specified as `<char, f32>`.
17 let _tuple1: PhantomTuple<char, f32> = PhantomTuple('Q', PhantomData);
18 // PhantomTuple type specified as `<char, f64>`.
19 let _tuple2: PhantomTuple<char, f64> = PhantomTuple('Q', PhantomData);
20
21 // Type specified as `<char, f32>`.
22 let _struct1: PhantomStruct<char, f32> = PhantomStruct {
23 first: 'Q',
24 phantom: PhantomData,
25 };
26 // Type specified as `<char, f64>`.
27 let _struct2: PhantomStruct<char, f64> = PhantomStruct {
28 first: 'Q',
29 phantom: PhantomData,
30 };
31
32 // Compile-time Error! Type mismatch so these cannot be compared:
33 // println!("_tuple1 == _tuple2 yields: {}",
34 // _tuple1 == _tuple2);
35
36 // Compile-time Error! Type mismatch so these cannot be compared:
37 // println!("_struct1 == _struct2 yields: {}",
38 // _struct1 == _struct2);
39 }
See also:
See also:
Borrowing ( & ), Bounds ( X: Y ), enum, impl & self, Overloading, ref, Traits ( X for Y ), and TupleStructs.
248
249
Scoping rules
Scopes play an important part in ownership, borrowing, and lifetimes. That is, they indicate to the compiler
when borrows are valid, when resources can be freed, and when variables are created or destroyed.
250
251
RAII
Variables in Rust do more than just hold data in the stack: they also own resources, e.g. Box<T> owns
memory in the heap. Rust enforces RAII (Resource Acquisition Is Initialization), so whenever an object goes
out of scope, its destructor is called and its owned resources are freed.
This behavior shields against resource leak bugs, so you'll never have to manually free memory or worry
about memory leaks again! Here's a quick showcase:
1 // raii.rs
2 fn create_box() {
3 // Allocate an integer on the heap
4 let _box1 = Box::new(3i32);
5
6 // `_box1` is destroyed here, and memory gets freed
7 }
8
9 fn main() {
10 // Allocate an integer on the heap
11 let _box2 = Box::new(5i32);
12
13 // A nested scope:
14 {
15 // Allocate an integer on the heap
16 let _box3 = Box::new(4i32);
17
18 // `_box3` is destroyed here, and memory gets freed
19 }
20
21 // Creating lots of boxes just for fun
22 // There's no need to manually free memory!
23 for _ in 0u32..1_000 {
24 create_box();
25 }
26
27 // `_box2` is destroyed here, and memory gets freed
28 }
No leaks here!
Destructor 252
The notion of a destructor in Rust is provided through the Drop trait. The destructor is called when the
resource goes out of scope. This trait is not required to be implemented for every type, only implement it for
your type if you require its own destructor logic.
Run the below example to see how the Drop trait works. When the variable in the main function goes out
of scope the custom destructor will be invoked.
1 struct ToDrop;
2
3 impl Drop for ToDrop {
4 fn drop(&mut self) {
5 println!("ToDrop is being dropped");
6 }
7 }
8
9 fn main() {
10 let x = ToDrop;
11 println!("Made a ToDrop!");
12 }
See also:
Box
253
254
When doing assignments ( let x = y ) or passing function arguments by value ( foo(x) ), the ownership
of the resources is transferred. In Rust-speak, this is known as a move.
After moving resources, the previous owner can no longer be used. This avoids creating dangling pointers.
Mutability
Mutability of data can be changed when ownership is transferred.
1 fn main() {
2 let immutable_box = Box::new(5u32);
3
4 println!("immutable_box contains {}", immutable_box);
5
6 // Mutability error
7 //*immutable_box = 4;
8
9 // *Move* the box, changing the ownership (and mutability)
10 let mut mutable_box = immutable_box;
11
12 println!("mutable_box contains {}", mutable_box);
13
14 // Modify the contents of the box
15 *mutable_box = 4;
16
17 println!("mutable_box now contains {}", mutable_box);
18 }
257
258
Partial moves
Within the destructuring of a single variable, both by-move and by-reference pattern bindings can be
used at the same time. Doing this will result in a partial move of the variable, which means that parts of the
variable will be moved while other parts stay. In such a case, the parent variable cannot be used afterwards
as a whole, however the parts that are only referenced (and not moved) can still be used.
1 fn main() {
2 #[derive(Debug)]
3 struct Person {
4 name: String,
5 age: Box<u8>,
6 }
7
8 let person = Person {
9 name: String::from("Alice"),
10 age: Box::new(20),
11 };
12
13 // `name` is moved out of person, but `age` is referenced
14 let Person { name, ref age } = person;
15
16 println!("The person's age is {}", age);
17
18 println!("The person's name is {}", name);
19
20 // Error! borrow of partially moved value: `person` partial move occurs
21 //println!("The person struct is {:?}", person);
22
23 // `person` cannot be used but `person.age` can be used as it is not moved
24 println!("The person's age from person struct is {}", person.age);
25 }
(In this example, we store the age variable on the heap to illustrate the partial move: deleting ref in the
above code would give an error as the ownership of person.age would be moved to the variable age . If
Person.age were stored on the stack, ref would not be required as the definition of age would copy the
data from person.age without moving it.)
See also:
destructuring
259
260
Borrowing
Most of the time, we'd like to access data without taking ownership over it. To accomplish this, Rust uses a
borrowing mechanism. Instead of passing objects by value ( T ), objects can be passed by reference ( &T ).
The compiler statically guarantees (via its borrow checker) that references always point to valid objects.
That is, while references to an object exist, the object cannot be destroyed.
Mutability
Mutable data can be mutably borrowed using &mut T . This is called a mutable reference and gives
read/write access to the borrower. In contrast, &T borrows the data via an immutable reference, and the
borrower can read the data but not modify it:
1 #[allow(dead_code)]
2 #[derive(Clone, Copy)]
3 struct Book {
4 // `&'static str` is a reference to a string allocated in read only memory
5 author: &'static str,
6 title: &'static str,
7 year: u32,
8 }
9
10 // This function takes a reference to a book
11 fn borrow_book(book: &Book) {
12 println!("I immutably borrowed {} - {} edition", book.title, book.year);
13 }
14
15 // This function takes a reference to a mutable book and changes `year` to 2014
16 fn new_edition(book: &mut Book) {
17 book.year = 2014;
18 println!("I mutably borrowed {} - {} edition", book.title, book.year);
19 }
20
21 fn main() {
22 // Create an immutable Book named `immutabook`
23 let immutabook = Book {
24 // string literals have type `&'static str`
25 author: "Douglas Hofstadter",
26 title: "Gödel, Escher, Bach",
27 year: 1979,
28 };
29
30 // Create a mutable copy of `immutabook` and call it `mutabook`
31 let mut mutabook = immutabook;
32
33 // Immutably borrow an immutable object
34 borrow_book(&immutabook);
35
36 // Immutably borrow a mutable object
37 borrow_book(&mutabook);
38
39 // Borrow a mutable object as mutable
40 new_edition(&mut mutabook);
41
42 // Error! Cannot borrow an immutable object as mutable
43 new_edition(&mut immutabook);
44 // FIXME ^ Comment out this line
45 }
See also:
static
263
264
Aliasing
Data can be immutably borrowed any number of times, but while immutably borrowed, the original data
can't be mutably borrowed. On the other hand, only one mutable borrow is allowed at a time. The original
data can be borrowed again only after the mutable reference has been used for the last time.
1 #[derive(Clone, Copy)]
2 struct Point { x: i32, y: i32 }
3
4 fn main() {
5 let c = 'Q';
6
7 // A `ref` borrow on the left side of an assignment is equivalent to
8 // an `&` borrow on the right side.
9 let ref ref_c1 = c;
10 let ref_c2 = &c;
11
12 println!("ref_c1 equals ref_c2: {}", *ref_c1 == *ref_c2);
13
14 let point = Point { x: 0, y: 0 };
15
16 // `ref` is also valid when destructuring a struct.
17 let _copy_of_x = {
18 // `ref_to_x` is a reference to the `x` field of `point`.
19 let Point { x: ref ref_to_x, y: _ } = point;
20
21 // Return a copy of the `x` field of `point`.
22 *ref_to_x
23 };
24
25 // A mutable copy of `point`
26 let mut mutable_point = point;
27
28 {
29 // `ref` can be paired with `mut` to take mutable references.
30 let Point { x: _, y: ref mut mut_ref_to_y } = mutable_point;
31
32 // Mutate the `y` field of `mutable_point` via a mutable reference.
33 *mut_ref_to_y = 1;
34 }
35
36 println!("point is ({}, {})", point.x, point.y);
37 println!("mutable_point is ({}, {})", mutable_point.x, mutable_point.y);
38
39 // A mutable tuple that includes a pointer
40 let mut mutable_tuple = (Box::new(5u32), 3u32);
41
42 {
43 // Destructure `mutable_tuple` to change the value of `last`.
44 let (_, ref mut last) = mutable_tuple;
45 *last = 2u32;
46 }
47
48 println!("tuple is {:?}", mutable_tuple);
49 }
267
268
Lifetimes
A lifetime is a construct the compiler (or more specifically, its borrow checker) uses to ensure all borrows
are valid. Specifically, a variable's lifetime begins when it is created and ends when it is destroyed. While
lifetimes and scopes are often referred to together, they are not the same.
Take, for example, the case where we borrow a variable via & . The borrow has a lifetime that is determined
by where it is declared. As a result, the borrow is valid as long as it ends before the lender is destroyed.
However, the scope of the borrow is determined by where the reference is used.
In the following example and in the rest of this section, we will see how lifetimes relate to scopes, as well
as how the two differ.
Note that no names or types are assigned to label lifetimes. This restricts how lifetimes will be able to be
used as we will see.
269
270
Explicit annotation
The borrow checker uses explicit lifetime annotations to determine how long references should be valid. In
cases where lifetimes are not elided1, Rust requires explicit annotations to determine what the lifetime of a
reference should be. The syntax for explicitly annotating a lifetime uses an apostrophe character as follows:
foo<'a>
// `foo` has a lifetime parameter `'a`
Similar to closures, using lifetimes requires generics. Additionally, this lifetime syntax indicates that the
lifetime of foo may not exceed that of 'a . Explicit annotation of a type has the form &'a T where 'a has
already been introduced.
foo<'a, 'b>
// `foo` has lifetime parameters `'a` and `'b`
In this case, the lifetime of foo cannot exceed that of either 'a or 'b .
See also:
Functions
Ignoring elision, function signatures with lifetimes have a few constraints:
Additionally, note that returning references without input is banned if it would result in returning references
to invalid data. The following example shows off some valid forms of functions with lifetimes:
See also:
Functions
274
275
Methods
Methods are annotated similarly to functions:
1 struct Owner(i32);
2
3 impl Owner {
4 // Annotate lifetimes as in a standalone function.
5 fn add_one<'a>(&'a mut self) { self.0 += 1; }
6 fn print<'a>(&'a self) {
7 println!("`print`: {}", self.0);
8 }
9 }
10
11 fn main() {
12 let mut owner = Owner(18);
13
14 owner.add_one();
15 owner.print();
16 }
See also:
methods
276
277
Structs
Annotation of lifetimes in structures are also similar to functions:
See also:
struct s
278
279
Traits
Annotation of lifetimes in trait methods basically are similar to functions. Note that impl may have
annotation of lifetimes too.
See also:
trait s
280
281
Bounds
Just like generic types can be bounded, lifetimes (themselves generic) use bounds as well. The : character
has a slightly different meaning here, but + is the same. Note how the following read:
The example below shows the above syntax in action used after keyword where :
See also:
Coercion
A longer lifetime can be coerced into a shorter one so that it works inside a scope it normally wouldn't work
in. This comes in the form of inferred coercion by the Rust compiler, and also in the form of declaring a
lifetime difference:
Static
Rust has a few reserved lifetime names. One of those is 'static . You might encounter it in two situations:
Both are related but subtly different and this is a common source for confusion when learning Rust. Here
are some examples for each situation:
Reference lifetime
As a reference lifetime 'static indicates that the data pointed to by the reference lives for the remaining
lifetime of the running program. It can still be coerced to a shorter lifetime.
There are two common ways to make a variable with 'static lifetime, and both are stored in the read-
only memory of the binary:
Since 'static references only need to be valid for the remainder of a program's life, they can be created
while the program is executed. Just to demonstrate, the below example uses Box::leak to dynamically
create 'static references. In that case it definitely doesn't live for the entire duration, but only for the
leaking point onward.
Trait bound
As a trait bound, it means the type does not contain any non-static references. Eg. the receiver can hold on
to the type for as long as they want and it will never become invalid until they drop it.
It's important to understand this means that any owned data always passes a 'static lifetime bound, but
287
a reference to that owned data generally does not:
1 use std::fmt::Debug;
2
3 fn print_it( input: impl Debug + 'static ) {
4 println!( "'static value passed in is: {:?}", input );
5 }
6
7 fn main() {
8 // i is owned and contains no references, thus it's 'static:
9 let i = 5;
10 print_it(i);
11
12 // oops, &i only has the lifetime defined by the scope of
13 // main(), so it's not 'static:
14 print_it(&i);
15 }
See also:
'static constants
288
289
Elision
Some lifetime patterns are overwhelmingly common and so the borrow checker will allow you to omit
them to save typing and to improve readability. This is known as elision. Elision exists in Rust solely
because these patterns are common.
The following code shows a few examples of elision. For a more comprehensive description of elision, see
lifetime elision in the book.
See also:
elision
290
291
Traits
A trait is a collection of methods defined for an unknown type: Self . They can access other methods
declared in the same trait.
Traits can be implemented for any data type. In the example below, we define Animal , a group of methods.
The Animal trait is then implemented for the Sheep data type, allowing the use of methods from
Animal with a Sheep .
1 struct Sheep { naked: bool, name: &'static str } 292
2
3 trait Animal {
4 // Associated function signature; `Self` refers to the implementor type.
5 fn new(name: &'static str) -> Self;
6
7 // Method signatures; these will return a string.
8 fn name(&self) -> &'static str;
9 fn noise(&self) -> &'static str;
10
11 // Traits can provide default method definitions.
12 fn talk(&self) {
13 println!("{} says {}", self.name(), self.noise());
14 }
15 }
16
17 impl Sheep {
18 fn is_naked(&self) -> bool {
19 self.naked
20 }
21
22 fn shear(&mut self) {
23 if self.is_naked() {
24 // Implementor methods can use the implementor's trait methods.
25 println!("{} is already naked...", self.name());
26 } else {
27 println!("{} gets a haircut!", self.name);
28
29 self.naked = true;
30 }
31 }
32 }
33
34 // Implement the `Animal` trait for `Sheep`.
35 impl Animal for Sheep {
36 // `Self` is the implementor type: `Sheep`.
37 fn new(name: &'static str) -> Sheep {
38 Sheep { name: name, naked: false }
39 }
40
41 fn name(&self) -> &'static str {
42 self.name
43 }
44
45 fn noise(&self) -> &'static str {
46 if self.is_naked() {
47 "baaaaah?"
48 } else {
49 "baaaaah!"
50 }
51 }
52
53 // Default trait methods can be overridden.
54 fn talk(&self) {
55 // For example, we can add some quiet contemplation.
56 println!("{} pauses briefly... {}", self.name, self.noise());
57 }
58 }
59
60 fn main() {
61 // Type annotation is necessary in this case.
62 let mut dolly: Sheep = Animal::new("Dolly");
63 // TODO ^ Try removing the type annotations.
64 293
65 dolly.talk();
66 dolly.shear();
67 dolly.talk();
68 }
294
295
Derive
The compiler is capable of providing basic implementations for some traits via the #[derive] attribute.
These traits can still be manually implemented if a more complex behavior is required.
See also:
derive
297
298
However, there's an easy workaround. Instead of returning a trait object directly, our functions return a Box
which contains some Animal . A box is just a reference to some memory in the heap. Because a reference
has a statically-known size, and the compiler can guarantee it points to a heap-allocated Animal , we can
return a trait from our function!
Rust tries to be as explicit as possible whenever it allocates memory on the heap. So if your function
returns a pointer-to-trait-on-heap in this way, you need to write the return type with the dyn keyword, e.g.
Box<dyn Animal> .
1 struct Sheep {}
2 struct Cow {}
3
4 trait Animal {
5 // Instance method signature
6 fn noise(&self) -> &'static str;
7 }
8
9 // Implement the `Animal` trait for `Sheep`.
10 impl Animal for Sheep {
11 fn noise(&self) -> &'static str {
12 "baaaaah!"
13 }
14 }
15
16 // Implement the `Animal` trait for `Cow`.
17 impl Animal for Cow {
18 fn noise(&self) -> &'static str {
19 "moooooo!"
20 }
21 }
22
23 // Returns some struct that implements Animal, but we don't know which one at compile ti
24 fn random_animal(random_number: f64) -> Box<dyn Animal> {
25 if random_number < 0.5 {
26 Box::new(Sheep {})
27 } else {
28 Box::new(Cow {})
29 }
30 }
31
32 fn main() {
33 let random_number = 0.234;
34 let animal = random_animal(random_number);
35 println!("You've randomly chosen an animal, and it says {}", animal.noise());
36 }
37
299
300
Operator Overloading
In Rust, many of the operators can be overloaded via traits. That is, some operators can be used to
accomplish different tasks based on their input arguments. This is possible because operators are syntactic
sugar for method calls. For example, the + operator in a + b calls the add method (as in a.add(b) ).
This add method is part of the Add trait. Hence, the + operator can be used by any implementor of the
Add trait.
A list of the traits, such as Add , that overload operators can be found in core::ops .
1 use std::ops;
2
3 struct Foo;
4 struct Bar;
5
6 #[derive(Debug)]
7 struct FooBar;
8
9 #[derive(Debug)]
10 struct BarFoo;
11
12 // The `std::ops::Add` trait is used to specify the functionality of `+`.
13 // Here, we make `Add<Bar>` - the trait for addition with a RHS of type `Bar`.
14 // The following block implements the operation: Foo + Bar = FooBar
15 impl ops::Add<Bar> for Foo {
16 type Output = FooBar;
17
18 fn add(self, _rhs: Bar) -> FooBar {
19 println!("> Foo.add(Bar) was called");
20
21 FooBar
22 }
23 }
24
25 // By reversing the types, we end up implementing non-commutative addition.
26 // Here, we make `Add<Foo>` - the trait for addition with a RHS of type `Foo`.
27 // This block implements the operation: Bar + Foo = BarFoo
28 impl ops::Add<Foo> for Bar {
29 type Output = BarFoo;
30
31 fn add(self, _rhs: Foo) -> BarFoo {
32 println!("> Bar.add(Foo) was called");
33
34 BarFoo
35 }
36 }
37
38 fn main() {
39 println!("Foo + Bar = {:?}", Foo + Bar);
40 println!("Bar + Foo = {:?}", Bar + Foo);
41 }
See Also
Drop
The Drop trait only has one method: drop , which is called automatically when an object goes out of
scope. The main use of the Drop trait is to free the resources that the implementor instance owns.
Box , Vec , String , File , and Process are some examples of types that implement the Drop trait to free
resources. The Drop trait can also be manually implemented for any custom data type.
The following example adds a print to console to the drop function to announce when it is called.
1 struct Droppable {
2 name: &'static str,
3 }
4
5 // This trivial implementation of `drop` adds a print to console.
6 impl Drop for Droppable {
7 fn drop(&mut self) {
8 println!("> Dropping {}", self.name);
9 }
10 }
11
12 fn main() {
13 let _a = Droppable { name: "a" };
14
15 // block A
16 {
17 let _b = Droppable { name: "b" };
18
19 // block B
20 {
21 let _c = Droppable { name: "c" };
22 let _d = Droppable { name: "d" };
23
24 println!("Exiting block B");
25 }
26 println!("Just exited block B");
27
28 println!("Exiting block A");
29 }
30 println!("Just exited block A");
31
32 // Variable can be manually dropped using the `drop` function
33 drop(_a);
34 // TODO ^ Try commenting this line
35
36 println!("end of the main function");
37
38 // `_a` *won't* be `drop`ed again here, because it already has been
39 // (manually) `drop`ed
40 }
303
304
Iterators
The Iterator trait is used to implement iterators over collections such as arrays.
The trait requires only a method to be defined for the next element, which may be manually defined in an
impl block or automatically defined (as in arrays and ranges).
As a point of convenience for common situations, the for construct turns some collections into iterators
using the .into_iter() method.
1 struct Fibonacci { 305
2 curr: u32,
3 next: u32,
4 }
5
6 // Implement `Iterator` for `Fibonacci`.
7 // The `Iterator` trait only requires a method to be defined for the `next` element.
8 impl Iterator for Fibonacci {
9 // We can refer to this type using Self::Item
10 type Item = u32;
11
12 // Here, we define the sequence using `.curr` and `.next`.
13 // The return type is `Option<T>`:
14 // * When the `Iterator` is finished, `None` is returned.
15 // * Otherwise, the next value is wrapped in `Some` and returned.
16 // We use Self::Item in the return type, so we can change
17 // the type without having to update the function signatures.
18 fn next(&mut self) -> Option<Self::Item> {
19 let current = self.curr;
20
21 self.curr = self.next;
22 self.next = current + self.next;
23
24 // Since there's no endpoint to a Fibonacci sequence, the `Iterator`
25 // will never return `None`, and `Some` is always returned.
26 Some(current)
27 }
28 }
29
30 // Returns a Fibonacci sequence generator
31 fn fibonacci() -> Fibonacci {
32 Fibonacci { curr: 0, next: 1 }
33 }
34
35 fn main() {
36 // `0..3` is an `Iterator` that generates: 0, 1, and 2.
37 let mut sequence = 0..3;
38
39 println!("Four consecutive `next` calls on 0..3");
40 println!("> {:?}", sequence.next());
41 println!("> {:?}", sequence.next());
42 println!("> {:?}", sequence.next());
43 println!("> {:?}", sequence.next());
44
45 // `for` works through an `Iterator` until it returns `None`.
46 // Each `Some` value is unwrapped and bound to a variable (here, `i`).
47 println!("Iterate through 0..3 using `for`");
48 for i in 0..3 {
49 println!("> {}", i);
50 }
51
52 // The `take(n)` method reduces an `Iterator` to its first `n` terms.
53 println!("The first four terms of the Fibonacci sequence are: ");
54 for i in fibonacci().take(4) {
55 println!("> {}", i);
56 }
57
58 // The `skip(n)` method shortens an `Iterator` by dropping its first `n` terms.
59 println!("The next four terms of the Fibonacci sequence are: ");
60 for i in fibonacci().skip(4).take(4) {
61 println!("> {}", i);
62 }
63
64 let array = [1u32, 3, 3, 7]; 306
65
66 // The `iter` method produces an `Iterator` over an array/slice.
67 println!("Iterate the following array {:?}", &array);
68 for i in array.iter() {
69 println!("> {}", i);
70 }
71 }
307
308
impl Trait
impl Trait can be used in two locations:
1. as an argument type
2. as a return type
As an argument type
If your function is generic over a trait but you don't mind the specific type, you can simplify the function
declaration using impl Trait as the type of the argument.
parse_csv_document is generic, allowing it to take any type which implements BufRead, such as
BufReader<File> or [u8] , but it's not important what type R is, and R is only used to declare the type of
src , so the function can also be written as:
Note that using impl Trait as an argument type means that you cannot explicitly state what form of the
function you use, i.e. parse_csv_document::<std::io::Empty>(std::io::empty()) will not work with the
second example.
As a return type 309
If your function returns a type that implements MyTrait , you can write its return type as -> impl MyTrait .
This can help simplify your type signatures quite a lot!
1 use std::iter;
2 use std::vec::IntoIter;
3
4 // This function combines two `Vec<i32>` and returns an iterator over it.
5 // Look how complicated its return type is!
6 fn combine_vecs_explicit_return_type(
7 v: Vec<i32>,
8 u: Vec<i32>,
9 ) -> iter::Cycle<iter::Chain<IntoIter<i32>, IntoIter<i32>>> {
10 v.into_iter().chain(u.into_iter()).cycle()
11 }
12
13 // This is the exact same function, but its return type uses `impl Trait`.
14 // Look how much simpler it is!
15 fn combine_vecs(
16 v: Vec<i32>,
17 u: Vec<i32>,
18 ) -> impl Iterator<Item=i32> {
19 v.into_iter().chain(u.into_iter()).cycle()
20 }
21
22 fn main() {
23 let v1 = vec![1, 2, 3];
24 let v2 = vec![4, 5];
25 let mut v3 = combine_vecs(v1, v2);
26 assert_eq!(Some(1), v3.next());
27 assert_eq!(Some(2), v3.next());
28 assert_eq!(Some(3), v3.next());
29 assert_eq!(Some(4), v3.next());
30 assert_eq!(Some(5), v3.next());
31 println!("all done");
32 }
More importantly, some Rust types can't be written out. For example, every closure has its own unnamed
concrete type. Before impl Trait syntax, you had to allocate on the heap in order to return a closure. But
now you can do it all statically, like this:
You can also use impl Trait to return an iterator that uses map or filter closures! This makes using
map and filter easier. Because closure types don't have names, you can't write out an explicit return type
if your function returns iterators with closures. But with impl Trait you can do this easily:
1 fn double_positives<'a>(numbers: &'a Vec<i32>) -> impl Iterator<Item = i32> + 'a { 310
2 numbers
3 .iter()
4 .filter(|x| x > &&0)
5 .map(|x| x * 2)
6 }
7
8 fn main() {
9 let singles = vec![-3, -2, 2, 3];
10 let doubles = double_positives(&singles);
11 assert_eq!(doubles.collect::<Vec<i32>>(), vec![4, 6]);
12 }
311
312
Clone
When dealing with resources, the default behavior is to transfer them during assignments or function calls.
However, sometimes we need to make a copy of the resource as well.
The Clone trait helps us do exactly this. Most commonly, we can use the .clone() method defined by
the Clone trait.
Supertraits
Rust doesn't have "inheritance", but you can define a trait as being a superset of another trait. For example:
1 trait Person {
2 fn name(&self) -> String;
3 }
4
5 // Person is a supertrait of Student.
6 // Implementing Student requires you to also impl Person.
7 trait Student: Person {
8 fn university(&self) -> String;
9 }
10
11 trait Programmer {
12 fn fav_language(&self) -> String;
13 }
14
15 // CompSciStudent (computer science student) is a subtrait of both Programmer
16 // and Student. Implementing CompSciStudent requires you to impl both supertraits.
17 trait CompSciStudent: Programmer + Student {
18 fn git_username(&self) -> String;
19 }
20
21 fn comp_sci_student_greeting(student: &dyn CompSciStudent) -> String {
22 format!(
23 "My name is {} and I attend {}. My favorite language is {}. My Git username is
24 student.name(),
25 student.university(),
26 student.fav_language(),
27 student.git_username()
28 )
29 }
30
31 fn main() {}
See also:
Good news: because each trait implementation gets its own impl block, it's clear which trait's get
method you're implementing.
What about when it comes time to call those methods? To disambiguate between them, we have to use
Fully Qualified Syntax.
1 trait UsernameWidget {
2 // Get the selected username out of this widget
3 fn get(&self) -> String;
4 }
5
6 trait AgeWidget {
7 // Get the selected age out of this widget
8 fn get(&self) -> u8;
9 }
10
11 // A form with both a UsernameWidget and an AgeWidget
12 struct Form {
13 username: String,
14 age: u8,
15 }
16
17 impl UsernameWidget for Form {
18 fn get(&self) -> String {
19 self.username.clone()
20 }
21 }
22
23 impl AgeWidget for Form {
24 fn get(&self) -> u8 {
25 self.age
26 }
27 }
28
29 fn main() {
30 let form = Form {
31 username: "rustacean".to_owned(),
32 age: 28,
33 };
34
35 // If you uncomment this line, you'll get an error saying
36 // "multiple `get` found". Because, after all, there are multiple methods
37 // named `get`.
38 // println!("{}", form.get());
39
40 let username = <Form as UsernameWidget>::get(&form);
41 assert_eq!("rustacean".to_owned(), username);
42 let age = <Form as AgeWidget>::get(&form);
43 assert_eq!(28, age);
44 }
See also: 317
macro_rules!
Rust provides a powerful macro system that allows metaprogramming. As you've seen in previous
chapters, macros look like functions, except that their name ends with a bang ! , but instead of generating
a function call, macros are expanded into source code that gets compiled with the rest of the program.
However, unlike macros in C and other languages, Rust macros are expanded into abstract syntax trees,
rather than string preprocessing, so you don't get unexpected precedence bugs.
1. Don't repeat yourself. There are many cases where you may need similar functionality in multiple
places but with different types. Often, writing a macro is a useful way to avoid repeating code. (More
on this later)
2. Domain-specific languages. Macros allow you to define special syntax for a specific purpose. (More
on this later)
3. Variadic interfaces. Sometimes you want to define an interface that takes a variable number of
arguments. An example is println! which could take any number of arguments, depending on the
format string. (More on this later)
320
321
Syntax
In following subsections, we will show how to define macros in Rust. There are three basic ideas:
Designators
The arguments of a macro are prefixed by a dollar sign $ and type annotated with a designator:
1 macro_rules! create_function {
2 // This macro takes an argument of designator `ident` and
3 // creates a function named `$func_name`.
4 // The `ident` designator is used for variable/function names.
5 ($func_name:ident) => {
6 fn $func_name() {
7 // The `stringify!` macro converts an `ident` into a string.
8 println!("You called {:?}()",
9 stringify!($func_name));
10 }
11 };
12 }
13
14 // Create functions named `foo` and `bar` with the above macro.
15 create_function!(foo);
16 create_function!(bar);
17
18 macro_rules! print_result {
19 // This macro takes an expression of type `expr` and prints
20 // it as a string along with its result.
21 // The `expr` designator is used for expressions.
22 ($expression:expr) => {
23 // `stringify!` will convert the expression *as it is* into a string.
24 println!("{:?} = {:?}",
25 stringify!($expression),
26 $expression);
27 };
28 }
29
30 fn main() {
31 foo();
32 bar();
33
34 print_result!(1u32 + 1);
35
36 // Recall that blocks are expressions too!
37 print_result!({
38 let x = 1u32;
39
40 x * x + 2 * x - 1
41 });
42 }
block
expr is used for expressions
ident is used for variable/function names
item
literal is used for literal constants
pat (pattern)
path
stmt (statement)
tt (token tree) 324
ty (type)
vis (visibility qualifier)
Overload
Macros can be overloaded to accept different combinations of arguments. In that regard, macro_rules! can
work similarly to a match block:
Repeat
Macros can use + in the argument list to indicate that an argument may repeat at least once, or * , to
indicate that the argument may repeat zero or more times.
In the following example, surrounding the matcher with $(...),+ will match one or more expression,
separated by commas. Also note that the semicolon is optional on the last case.
Suppose that I want to define a little calculator API. I would like to supply an expression and have the
output printed to console.
1 macro_rules! calculate {
2 (eval $e:expr) => {
3 {
4 let val: usize = $e; // Force types to be unsigned integers
5 println!("{} = {}", stringify!{$e}, val);
6 }
7 };
8 }
9
10 fn main() {
11 calculate! {
12 eval 1 + 2 // hehehe `eval` is _not_ a Rust keyword!
13 }
14
15 calculate! {
16 eval (1 + 2) * (3 / 4)
17 }
18 }
Output:
1 + 2 = 3
(1 + 2) * (3 / 4) = 0
This was a very simple example, but much more complex interfaces have been developed, such as
lazy_static or clap .
Also, note the two pairs of braces in the macro. The outer ones are part of the syntax of macro_rules! , in
addition to () or [] .
335
336
Variadic Interfaces
A variadic interface takes an arbitrary number of arguments. For example, println! can take an arbitrary
number of arguments, as determined by the format string.
We can extend our calculate! macro from the previous section to be variadic:
1 macro_rules! calculate {
2 // The pattern for a single `eval`
3 (eval $e:expr) => {
4 {
5 let val: usize = $e; // Force types to be integers
6 println!("{} = {}", stringify!{$e}, val);
7 }
8 };
9
10 // Decompose multiple `eval`s recursively
11 (eval $e:expr, $(eval $es:expr),+) => {{
12 calculate! { eval $e }
13 calculate! { $(eval $es),+ }
14 }};
15 }
16
17 fn main() {
18 calculate! { // Look ma! Variadic `calculate!`!
19 eval 1 + 2,
20 eval 3 + 4,
21 eval (2 * 3) + 1
22 }
23 }
Output:
1 + 2 = 3
3 + 4 = 7
(2 * 3) + 1 = 7
337
338
Error handling
Error handling is the process of handling the possibility of failure. For example, failing to read a file and
then continuing to use that bad input would clearly be problematic. Noticing and explicitly managing those
errors saves the rest of the program from various pitfalls.
There are various ways to deal with errors in Rust, which are described in the following subchapters. They
all have more or less subtle differences and different use cases. As a rule of thumb:
An explicit panic is mainly useful for tests and dealing with unrecoverable errors. For prototyping it can be
useful, for example when dealing with functions that haven't been implemented yet, but in those cases the
more descriptive unimplemented is better. In tests panic is a reasonable way to explicitly fail.
The Option type is for when a value is optional or when the lack of a value is not an error condition. For
example the parent of a directory - / and C: don't have one. When dealing with Option s, unwrap is fine
for prototyping and cases where it's absolutely certain that there is guaranteed to be a value. However
expect is more useful since it lets you specify an error message in case something goes wrong anyway.
When there is a chance that things do go wrong and the caller has to deal with the problem, use Result .
You can unwrap and expect them as well (please don't do that unless it's a test or quick prototype).
For a more rigorous discussion of error handling, refer to the error handling section in the official book.
339
340
panic
The simplest error handling mechanism we will see is panic . It prints an error message, starts unwinding
the stack, and usually exits the program. Here, we explicitly call panic on our error condition:
1 fn drink(beverage: &str) {
2 // You shouldn't drink too much sugary beverages.
3 if beverage == "lemonade" { panic!("AAAaaaaa!!!!"); }
4
5 println!("Some refreshing {} is all I need.", beverage);
6 }
7
8 fn main() {
9 drink("water");
10 drink("lemonade");
11 drink("still water");
12 }
The first call to drink works. The second panics and thus the third is never called.
341
342
Building on the prior lemonade example, we explicitly use the panic strategy to exercise different lines of
code.
1 fn drink(beverage: &str) {
2 // You shouldn't drink too much sugary beverages.
3 if beverage == "lemonade" {
4 if cfg!(panic = "abort") {
5 println!("This is not your party. Run!!!!");
6 } else {
7 println!("Spit it out!!!!");
8 }
9 } else {
10 println!("Some refreshing {} is all I need.", beverage);
11 }
12 }
13
14 fn main() {
15 drink("water");
16 drink("lemonade");
17 }
Here is another example focusing on rewriting drink() and explicitly use the unwind keyword.
1 #[cfg(panic = "unwind")]
2 fn ah() {
3 println!("Spit it out!!!!");
4 }
5
6 #[cfg(not(panic = "unwind"))]
7 fn ah() {
8 println!("This is not your party. Run!!!!");
9 }
10
11 fn drink(beverage: &str) {
12 if beverage == "lemonade" {
13 ah();
14 } else {
15 println!("Some refreshing {} is all I need.", beverage);
16 }
17 }
18
19 fn main() {
20 drink("water");
21 drink("lemonade");
22 }
The panic strategy can be set from the command line by using abort or unwind .
We could test this against the null string ( "" ) as we do with a lemonade. Since we're using Rust, let's
instead have the compiler point out cases where there's no drink.
An enum called Option<T> in the std library is used when absence is a possibility. It manifests itself as
one of two "options":
These cases can either be explicitly handled via match or implicitly with unwrap . Implicit handling will
either return the inner element or panic .
Note that it's possible to manually customize panic with expect, but unwrap otherwise leaves us with a
less meaningful output than explicit handling. In the following example, explicit handling yields a more
controlled result while retaining the option to panic if desired.
1 // The adult has seen it all, and can handle any drink well. 345
2 // All drinks are handled explicitly using `match`.
3 fn give_adult(drink: Option<&str>) {
4 // Specify a course of action for each case.
5 match drink {
6 Some("lemonade") => println!("Yuck! Too sugary."),
7 Some(inner) => println!("{}? How nice.", inner),
8 None => println!("No drink? Oh well."),
9 }
10 }
11
12 // Others will `panic` before drinking sugary drinks.
13 // All drinks are handled implicitly using `unwrap`.
14 fn drink(drink: Option<&str>) {
15 // `unwrap` returns a `panic` when it receives a `None`.
16 let inside = drink.unwrap();
17 if inside == "lemonade" { panic!("AAAaaaaa!!!!"); }
18
19 println!("I love {}s!!!!!", inside);
20 }
21
22 fn main() {
23 let water = Some("water");
24 let lemonade = Some("lemonade");
25 let void = None;
26
27 give_adult(water);
28 give_adult(lemonade);
29 give_adult(void);
30
31 let coffee = Some("coffee");
32 let nothing = None;
33
34 drink(coffee);
35 drink(nothing);
36 }
346
347
You can chain many ? s together to make your code much more readable.
1 struct Person {
2 job: Option<Job>,
3 }
4
5 #[derive(Clone, Copy)]
6 struct Job {
7 phone_number: Option<PhoneNumber>,
8 }
9
10 #[derive(Clone, Copy)]
11 struct PhoneNumber {
12 area_code: Option<u8>,
13 number: u32,
14 }
15
16 impl Person {
17
18 // Gets the area code of the phone number of the person's job, if it exists.
19 fn work_phone_area_code(&self) -> Option<u8> {
20 // This would need many nested `match` statements without the `?` operator.
21 // It would take a lot more code - try writing it yourself and see which
22 // is easier.
23 self.job?.phone_number?.area_code
24 }
25 }
26
27 fn main() {
28 let p = Person {
29 job: Some(Job {
30 phone_number: Some(PhoneNumber {
31 area_code: Some(61),
32 number: 439222222,
33 }),
34 }),
35 };
36
37 assert_eq!(p.work_phone_area_code(), Some(61));
38 }
348
349
Combinators: map
match is a valid method for handling Option s. However, you may eventually find heavy usage tedious,
especially with operations only valid with an input. In these cases, combinators can be used to manage
control flow in a modular fashion.
Option has a built in method called map() , a combinator for the simple mapping of Some -> Some and
None -> None . Multiple map() calls can be chained together for even more flexibility.
In the following example, process() replaces all functions previous to it while staying compact.
1 #![allow(dead_code)] 350
2
3 #[derive(Debug)] enum Food { Apple, Carrot, Potato }
4
5 #[derive(Debug)] struct Peeled(Food);
6 #[derive(Debug)] struct Chopped(Food);
7 #[derive(Debug)] struct Cooked(Food);
8
9 // Peeling food. If there isn't any, then return `None`.
10 // Otherwise, return the peeled food.
11 fn peel(food: Option<Food>) -> Option<Peeled> {
12 match food {
13 Some(food) => Some(Peeled(food)),
14 None => None,
15 }
16 }
17
18 // Chopping food. If there isn't any, then return `None`.
19 // Otherwise, return the chopped food.
20 fn chop(peeled: Option<Peeled>) -> Option<Chopped> {
21 match peeled {
22 Some(Peeled(food)) => Some(Chopped(food)),
23 None => None,
24 }
25 }
26
27 // Cooking food. Here, we showcase `map()` instead of `match` for case handling.
28 fn cook(chopped: Option<Chopped>) -> Option<Cooked> {
29 chopped.map(|Chopped(food)| Cooked(food))
30 }
31
32 // A function to peel, chop, and cook food all in sequence.
33 // We chain multiple uses of `map()` to simplify the code.
34 fn process(food: Option<Food>) -> Option<Cooked> {
35 food.map(|f| Peeled(f))
36 .map(|Peeled(f)| Chopped(f))
37 .map(|Chopped(f)| Cooked(f))
38 }
39
40 // Check whether there's food or not before trying to eat it!
41 fn eat(food: Option<Cooked>) {
42 match food {
43 Some(food) => println!("Mmm. I love {:?}", food),
44 None => println!("Oh no! It wasn't edible."),
45 }
46 }
47
48 fn main() {
49 let apple = Some(Food::Apple);
50 let carrot = Some(Food::Carrot);
51 let potato = None;
52
53 let cooked_apple = cook(chop(peel(apple)));
54 let cooked_carrot = cook(chop(peel(carrot)));
55 // Let's try the simpler looking `process()` now.
56 let cooked_potato = process(potato);
57
58 eat(cooked_apple);
59 eat(cooked_carrot);
60 eat(cooked_potato);
61 }
See also: 351
Combinators: and_then
map() was described as a chainable way to simplify match statements. However, using map() on a
function that returns an Option<T> results in the nested Option<Option<T>> . Chaining multiple calls
together can then become confusing. That's where another combinator called and_then() , known in some
languages as flatmap, comes in.
and_then() calls its function input with the wrapped value and returns the result. If the Option is None ,
then it returns None instead.
See also:
1 #[derive(Debug)]
2 enum Fruit { Apple, Orange, Banana, Kiwi, Lemon }
3
4 fn main() {
5 let apple = Some(Fruit::Apple);
6 let orange = Some(Fruit::Orange);
7 let no_fruit: Option<Fruit> = None;
8
9 let first_available_fruit = no_fruit.or(orange).or(apple);
10 println!("first_available_fruit: {:?}", first_available_fruit);
11 // first_available_fruit: Some(Orange)
12
13 // `or` moves its argument.
14 // In the example above, `or(orange)` returned a `Some`, so `or(apple)` was not invo
15 // But the variable named `apple` has been moved regardless, and cannot be used anym
16 // println!("Variable apple was moved, so this line won't compile: {:?}", apple);
17 // TODO: uncomment the line above to see the compiler error
18 }
1 #[derive(Debug)]
2 enum Fruit { Apple, Orange, Banana, Kiwi, Lemon }
3
4 fn main() {
5 let mut my_fruit: Option<Fruit> = None;
6 let apple = Fruit::Apple;
7 let first_available_fruit = my_fruit.get_or_insert(apple);
8 println!("first_available_fruit is: {:?}", first_available_fruit);
9 println!("my_fruit is: {:?}", my_fruit);
10 // first_available_fruit is: Apple
11 // my_fruit is: Some(Apple)
12 //println!("Variable named `apple` is moved: {:?}", apple);
13 // TODO: uncomment the line above to see the compiler error
14 }
See also:
Result
Result is a richer version of the Option type that describes possible error instead of possible absence.
Like Option , Result has many methods associated with it. unwrap() , for example, either yields the
element T or panic s. For case handling, there are many combinators between Result and Option that
overlap.
In working with Rust, you will likely encounter methods that return the Result type, such as the parse()
method. It might not always be possible to parse a string into the other type, so parse() returns a Result
indicating possible failure.
Let's see what happens when we successfully and unsuccessfully parse() a string:
In the unsuccessful case, parse() leaves us with an error for unwrap() to panic on. Additionally, the
panic exits our program and provides an unpleasant error message.
To improve the quality of our error message, we should be more specific about the return type and consider
explicitly handling the error.
fn main() {
println!("Hello World!");
}
However main is also able to have a return type of Result . If an error occurs within the main function it 361
will return an error code and print a debug representation of the error (using the Debug trait). The following
example shows such a scenario and touches on aspects covered in the following section.
1 use std::num::ParseIntError;
2
3 fn main() -> Result<(), ParseIntError> {
4 let number_str = "10";
5 let number = match number_str.parse::<i32>() {
6 Ok(number) => number,
7 Err(e) => return Err(e),
8 };
9 println!("{}", number);
10 Ok(())
11 }
362
363
We first need to know what kind of error type we are dealing with. To determine the Err type, we look to
parse() , which is implemented with the FromStr trait for i32 . As a result, the Err type is specified as
ParseIntError .
In the example below, the straightforward match statement leads to code that is overall more
cumbersome.
1 use std::num::ParseIntError;
2
3 // With the return type rewritten, we use pattern matching without `unwrap()`.
4 fn multiply(first_number_str: &str, second_number_str: &str) -> Result<i32, ParseIntErro
5 match first_number_str.parse::<i32>() {
6 Ok(first_number) => {
7 match second_number_str.parse::<i32>() {
8 Ok(second_number) => {
9 Ok(first_number * second_number)
10 },
11 Err(e) => Err(e),
12 }
13 },
14 Err(e) => Err(e),
15 }
16 }
17
18 fn print(result: Result<i32, ParseIntError>) {
19 match result {
20 Ok(n) => println!("n is {}", n),
21 Err(e) => println!("Error: {}", e),
22 }
23 }
24
25 fn main() {
26 // This still presents a reasonable answer.
27 let twenty = multiply("10", "2");
28 print(twenty);
29
30 // The following now provides a much more helpful error message.
31 let tt = multiply("t", "2");
32 print(tt);
33 }
Luckily, Option 's map , and_then , and many other combinators are also implemented for Result .
Result contains a complete listing.
1 use std::num::ParseIntError; 364
2
3 // As with `Option`, we can use combinators such as `map()`.
4 // This function is otherwise identical to the one above and reads:
5 // Multiply if both values can be parsed from str, otherwise pass on the error.
6 fn multiply(first_number_str: &str, second_number_str: &str) -> Result<i32, ParseIntErro
7 first_number_str.parse::<i32>().and_then(|first_number| {
8 second_number_str.parse::<i32>().map(|second_number| first_number * second_numbe
9 })
10 }
11
12 fn print(result: Result<i32, ParseIntError>) {
13 match result {
14 Ok(n) => println!("n is {}", n),
15 Err(e) => println!("Error: {}", e),
16 }
17 }
18
19 fn main() {
20 // This still presents a reasonable answer.
21 let twenty = multiply("10", "2");
22 print(twenty);
23
24 // The following now provides a much more helpful error message.
25 let tt = multiply("t", "2");
26 print(tt);
27 }
365
366
At a module level, creating aliases can be particularly helpful. Errors found in a specific module often have
the same Err type, so a single alias can succinctly define all associated Results . This is so useful that
the std library even supplies one: io::Result !
1 use std::num::ParseIntError;
2
3 // Define a generic alias for a `Result` with the error type `ParseIntError`.
4 type AliasedResult<T> = Result<T, ParseIntError>;
5
6 // Use the above alias to refer to our specific `Result` type.
7 fn multiply(first_number_str: &str, second_number_str: &str) -> AliasedResult<i32> {
8 first_number_str.parse::<i32>().and_then(|first_number| {
9 second_number_str.parse::<i32>().map(|second_number| first_number * second_numbe
10 })
11 }
12
13 // Here, the alias again allows us to save some space.
14 fn print(result: AliasedResult<i32>) {
15 match result {
16 Ok(n) => println!("n is {}", n),
17 Err(e) => println!("Error: {}", e),
18 }
19 }
20
21 fn main() {
22 print(multiply("10", "2"));
23 print(multiply("t", "2"));
24 }
See also:
io::Result
367
368
Early returns
In the previous example, we explicitly handled the errors using combinators. Another way to deal with this
case analysis is to use a combination of match statements and early returns.
That is, we can simply stop executing the function and return the error if one occurs. For some, this form of
code can be easier to both read and write. Consider this version of the previous example, rewritten using
early returns:
1 use std::num::ParseIntError;
2
3 fn multiply(first_number_str: &str, second_number_str: &str) -> Result<i32, ParseIntErro
4 let first_number = match first_number_str.parse::<i32>() {
5 Ok(first_number) => first_number,
6 Err(e) => return Err(e),
7 };
8
9 let second_number = match second_number_str.parse::<i32>() {
10 Ok(second_number) => second_number,
11 Err(e) => return Err(e),
12 };
13
14 Ok(first_number * second_number)
15 }
16
17 fn print(result: Result<i32, ParseIntError>) {
18 match result {
19 Ok(n) => println!("n is {}", n),
20 Err(e) => println!("Error: {}", e),
21 }
22 }
23
24 fn main() {
25 print(multiply("10", "2"));
26 print(multiply("t", "2"));
27 }
At this point, we've learned to explicitly handle errors using combinators and early returns. While we
generally want to avoid panicking, explicitly handling all of our errors is cumbersome.
In the next section, we'll introduce ? for the cases where we simply need to unwrap without possibly
inducing panic .
369
370
Introducing ?
Sometimes we just want the simplicity of unwrap without the possibility of a panic . Until now, unwrap
has forced us to nest deeper and deeper when what we really wanted was to get the variable out. This is
exactly the purpose of ? .
? is almost1 exactly equivalent to an unwrap which return s instead of panic king on Err s. Let's see
how we can simplify the earlier example that used combinators:
1 use std::num::ParseIntError;
2
3 fn multiply(first_number_str: &str, second_number_str: &str) -> Result<i32, ParseIntErro
4 let first_number = first_number_str.parse::<i32>()?;
5 let second_number = second_number_str.parse::<i32>()?;
6
7 Ok(first_number * second_number)
8 }
9
10 fn print(result: Result<i32, ParseIntError>) {
11 match result {
12 Ok(n) => println!("n is {}", n),
13 Err(e) => println!("Error: {}", e),
14 }
15 }
16
17 fn main() {
18 print(multiply("10", "2"));
19 print(multiply("t", "2"));
20 }
Sometimes an Option needs to interact with a Result , or a Result<T, Error1> needs to interact with a
Result<T, Error2> . In those cases, we want to manage our different error types in a way that makes them
composable and easy to interact with.
In the following code, two instances of unwrap generate different error types. Vec::first returns an
Option , while parse::<i32> returns a Result<i32, ParseIntError> :
Over the next sections, we'll see several strategies for handling these kind of problems.
374
375
1 use std::num::ParseIntError;
2
3 fn double_first(vec: Vec<&str>) -> Option<Result<i32, ParseIntError>> {
4 vec.first().map(|first| {
5 first.parse::<i32>().map(|n| 2 * n)
6 })
7 }
8
9 fn main() {
10 let numbers = vec!["42", "93", "18"];
11 let empty = vec![];
12 let strings = vec!["tofu", "93", "18"];
13
14 println!("The first doubled is {:?}", double_first(numbers));
15
16 println!("The first doubled is {:?}", double_first(empty));
17 // Error 1: the input vector is empty
18
19 println!("The first doubled is {:?}", double_first(strings));
20 // Error 2: the element doesn't parse to a number
21 }
There are times when we'll want to stop processing on errors (like with ? ) but keep going when the
Option is None . The transpose function comes in handy to swap the Result and Option .
1 use std::num::ParseIntError;
2
3 fn double_first(vec: Vec<&str>) -> Result<Option<i32>, ParseIntError> {
4 let opt = vec.first().map(|first| {
5 first.parse::<i32>().map(|n| 2 * n)
6 });
7
8 opt.transpose()
9 }
10
11 fn main() {
12 let numbers = vec!["42", "93", "18"];
13 let empty = vec![];
14 let strings = vec!["tofu", "93", "18"];
15
16 println!("The first doubled is {:?}", double_first(numbers));
17 println!("The first doubled is {:?}", double_first(empty));
18 println!("The first doubled is {:?}", double_first(strings));
19 }
376
377
Rust allows us to define our own error types. In general, a "good" error type:
Boxing errors
A way to write simple code while preserving the original errors is to Box them. The drawback is that the
underlying error type is only known at runtime and not statically determined.
The stdlib helps in boxing our errors by having Box implement conversion from any type that implements
the Error trait into the trait object Box<Error> , via From .
1 use std::error;
2 use std::fmt;
3
4 // Change the alias to use `Box<dyn error::Error>`.
5 type Result<T> = std::result::Result<T, Box<dyn error::Error>>;
6
7 #[derive(Debug, Clone)]
8 struct EmptyVec;
9
10 impl fmt::Display for EmptyVec {
11 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
12 write!(f, "invalid first item to double")
13 }
14 }
15
16 impl error::Error for EmptyVec {}
17
18 fn double_first(vec: Vec<&str>) -> Result<i32> {
19 vec.first()
20 .ok_or_else(|| EmptyVec.into()) // Converts to Box
21 .and_then(|s| {
22 s.parse::<i32>()
23 .map_err(|e| e.into()) // Converts to Box
24 .map(|i| 2 * i)
25 })
26 }
27
28 fn print(result: Result<i32>) {
29 match result {
30 Ok(n) => println!("The first doubled is {}", n),
31 Err(e) => println!("Error: {}", e),
32 }
33 }
34
35 fn main() {
36 let numbers = vec!["42", "93", "18"];
37 let empty = vec![];
38 let strings = vec!["tofu", "93", "18"];
39
40 print(double_first(numbers));
41 print(double_first(empty));
42 print(double_first(strings));
43 }
See also:
Other uses of ?
Notice in the previous example that our immediate reaction to calling parse is to map the error from a
library error into a boxed error:
.and_then(|s| s.parse::<i32>())
.map_err(|e| e.into())
Since this is a simple and common operation, it would be convenient if it could be elided. Alas, because
and_then is not sufficiently flexible, it cannot. However, we can instead use ? .
? was previously explained as either unwrap or return Err(err) . This is only mostly true. It actually
means unwrap or return Err(From::from(err)) . Since From::from is a conversion utility between
different types, this means that if you ? where the error is convertible to the return type, it will convert
automatically.
Here, we rewrite the previous example using ? . As a result, the map_err will go away when From::from
is implemented for our error type:
1 use std::error; 383
2 use std::fmt;
3
4 // Change the alias to use `Box<dyn error::Error>`.
5 type Result<T> = std::result::Result<T, Box<dyn error::Error>>;
6
7 #[derive(Debug)]
8 struct EmptyVec;
9
10 impl fmt::Display for EmptyVec {
11 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
12 write!(f, "invalid first item to double")
13 }
14 }
15
16 impl error::Error for EmptyVec {}
17
18 // The same structure as before but rather than chain all `Results`
19 // and `Options` along, we `?` to get the inner value out immediately.
20 fn double_first(vec: Vec<&str>) -> Result<i32> {
21 let first = vec.first().ok_or(EmptyVec)?;
22 let parsed = first.parse::<i32>()?;
23 Ok(2 * parsed)
24 }
25
26 fn print(result: Result<i32>) {
27 match result {
28 Ok(n) => println!("The first doubled is {}", n),
29 Err(e) => println!("Error: {}", e),
30 }
31 }
32
33 fn main() {
34 let numbers = vec!["42", "93", "18"];
35 let empty = vec![];
36 let strings = vec!["tofu", "93", "18"];
37
38 print(double_first(numbers));
39 print(double_first(empty));
40 print(double_first(strings));
41 }
This is actually fairly clean now. Compared with the original panic , it is very similar to replacing the
unwrap calls with ? except that the return types are Result . As a result, they must be destructured at the
top level.
See also:
From::from and ?
384
385
Wrapping errors
An alternative to boxing errors is to wrap them in your own error type.
1 use std::error; 386
2 use std::error::Error;
3 use std::num::ParseIntError;
4 use std::fmt;
5
6 type Result<T> = std::result::Result<T, DoubleError>;
7
8 #[derive(Debug)]
9 enum DoubleError {
10 EmptyVec,
11 // We will defer to the parse error implementation for their error.
12 // Supplying extra info requires adding more data to the type.
13 Parse(ParseIntError),
14 }
15
16 impl fmt::Display for DoubleError {
17 fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
18 match *self {
19 DoubleError::EmptyVec =>
20 write!(f, "please use a vector with at least one element"),
21 // The wrapped error contains additional information and is available
22 // via the source() method.
23 DoubleError::Parse(..) =>
24 write!(f, "the provided string could not be parsed as int"),
25 }
26 }
27 }
28
29 impl error::Error for DoubleError {
30 fn source(&self) -> Option<&(dyn error::Error + 'static)> {
31 match *self {
32 DoubleError::EmptyVec => None,
33 // The cause is the underlying implementation error type. Is implicitly
34 // cast to the trait object `&error::Error`. This works because the
35 // underlying type already implements the `Error` trait.
36 DoubleError::Parse(ref e) => Some(e),
37 }
38 }
39 }
40
41 // Implement the conversion from `ParseIntError` to `DoubleError`.
42 // This will be automatically called by `?` if a `ParseIntError`
43 // needs to be converted into a `DoubleError`.
44 impl From<ParseIntError> for DoubleError {
45 fn from(err: ParseIntError) -> DoubleError {
46 DoubleError::Parse(err)
47 }
48 }
49
50 fn double_first(vec: Vec<&str>) -> Result<i32> {
51 let first = vec.first().ok_or(DoubleError::EmptyVec)?;
52 // Here we implicitly use the `ParseIntError` implementation of `From` (which
53 // we defined above) in order to create a `DoubleError`.
54 let parsed = first.parse::<i32>()?;
55
56 Ok(2 * parsed)
57 }
58
59 fn print(result: Result<i32>) {
60 match result {
61 Ok(n) => println!("The first doubled is {}", n),
62 Err(e) => {
63 println!("Error: {}", e);
64 if let Some(source) = e.source() { 387
65 println!(" Caused by: {}", source);
66 }
67 },
68 }
69 }
70
71 fn main() {
72 let numbers = vec!["42", "93", "18"];
73 let empty = vec![];
74 let strings = vec!["tofu", "93", "18"];
75
76 print(double_first(numbers));
77 print(double_first(empty));
78 print(double_first(strings));
79 }
This adds a bit more boilerplate for handling errors and might not be needed in all applications. There are
some libraries that can take care of the boilerplate for you.
See also:
1 fn main() {
2 let strings = vec!["tofu", "93", "18"];
3 let numbers: Vec<_> = strings
4 .into_iter()
5 .map(|s| s.parse::<i32>())
6 .collect();
7 println!("Results: {:?}", numbers);
8 }
1 fn main() {
2 let strings = vec!["tofu", "93", "18"];
3 let numbers: Vec<_> = strings
4 .into_iter()
5 .filter_map(|s| s.parse::<i32>().ok())
6 .collect();
7 println!("Results: {:?}", numbers);
8 }
1 fn main() {
2 let strings = vec!["42", "tofu", "93", "999", "18"];
3 let mut errors = vec![];
4 let numbers: Vec<_> = strings
5 .into_iter()
6 .map(|s| s.parse::<u8>())
7 .filter_map(|r| r.map_err(|e| errors.push(e)).ok())
8 .collect();
9 println!("Numbers: {:?}", numbers);
10 println!("Errors: {:?}", errors);
11 }
Fail the entire operation with collect() 390
Result implements FromIterator so that a vector of results ( Vec<Result<T, E>> ) can be turned into a
result with a vector ( Result<Vec<T>, E> ). Once an Result::Err is found, the iteration will terminate.
1 fn main() {
2 let strings = vec!["tofu", "93", "18"];
3 let numbers: Result<Vec<_>, _> = strings
4 .into_iter()
5 .map(|s| s.parse::<i32>())
6 .collect();
7 println!("Results: {:?}", numbers);
8 }
When you look at the results, you'll note that everything is still wrapped in Result . A little more
boilerplate is needed for this.
1 fn main() {
2 let strings = vec!["tofu", "93", "18"];
3 let (numbers, errors): (Vec<_>, Vec<_>) = strings
4 .into_iter()
5 .map(|s| s.parse::<i32>())
6 .partition(Result::is_ok);
7 let numbers: Vec<_> = numbers.into_iter().map(Result::unwrap).collect();
8 let errors: Vec<_> = errors.into_iter().map(Result::unwrap_err).collect();
9 println!("Numbers: {:?}", numbers);
10 println!("Errors: {:?}", errors);
11 }
391
392
See also:
Boxed values can be dereferenced using the * operator; this removes one layer of indirection.
1 use std::mem; 395
2
3 #[allow(dead_code)]
4 #[derive(Debug, Clone, Copy)]
5 struct Point {
6 x: f64,
7 y: f64,
8 }
9
10 // A Rectangle can be specified by where its top left and bottom right
11 // corners are in space
12 #[allow(dead_code)]
13 struct Rectangle {
14 top_left: Point,
15 bottom_right: Point,
16 }
17
18 fn origin() -> Point {
19 Point { x: 0.0, y: 0.0 }
20 }
21
22 fn boxed_origin() -> Box<Point> {
23 // Allocate this point on the heap, and return a pointer to it
24 Box::new(Point { x: 0.0, y: 0.0 })
25 }
26
27 fn main() {
28 // (all the type annotations are superfluous)
29 // Stack allocated variables
30 let point: Point = origin();
31 let rectangle: Rectangle = Rectangle {
32 top_left: origin(),
33 bottom_right: Point { x: 3.0, y: -4.0 }
34 };
35
36 // Heap allocated rectangle
37 let boxed_rectangle: Box<Rectangle> = Box::new(Rectangle {
38 top_left: origin(),
39 bottom_right: Point { x: 3.0, y: -4.0 },
40 });
41
42 // The output of functions can be boxed
43 let boxed_point: Box<Point> = Box::new(origin());
44
45 // Double indirection
46 let box_in_a_box: Box<Box<Point>> = Box::new(boxed_origin());
47
48 println!("Point occupies {} bytes on the stack",
49 mem::size_of_val(&point));
50 println!("Rectangle occupies {} bytes on the stack",
51 mem::size_of_val(&rectangle));
52
53 // box size == pointer size
54 println!("Boxed point occupies {} bytes on the stack",
55 mem::size_of_val(&boxed_point));
56 println!("Boxed rectangle occupies {} bytes on the stack",
57 mem::size_of_val(&boxed_rectangle));
58 println!("Boxed box occupies {} bytes on the stack",
59 mem::size_of_val(&box_in_a_box));
60
61 // Copy the data contained in `boxed_point` into `unboxed_point`
62 let unboxed_point: Point = *boxed_point;
63 println!("Unboxed point occupies {} bytes on the stack",
64 mem::size_of_val(&unboxed_point)); 396
65 }
397
398
Vectors
Vectors are re-sizable arrays. Like slices, their size is not known at compile time, but they can grow or
shrink at any time. A vector is represented using 3 parameters:
The capacity indicates how much memory is reserved for the vector. The vector can grow as long as the
length is smaller than the capacity. When this threshold needs to be surpassed, the vector is reallocated
with a larger capacity.
1 fn main() { 399
2 // Iterators can be collected into vectors
3 let collected_iterator: Vec<i32> = (0..10).collect();
4 println!("Collected (0..10) into: {:?}", collected_iterator);
5
6 // The `vec!` macro can be used to initialize a vector
7 let mut xs = vec![1i32, 2, 3];
8 println!("Initial vector: {:?}", xs);
9
10 // Insert new element at the end of the vector
11 println!("Push 4 into the vector");
12 xs.push(4);
13 println!("Vector: {:?}", xs);
14
15 // Error! Immutable vectors can't grow
16 collected_iterator.push(0);
17 // FIXME ^ Comment out this line
18
19 // The `len` method yields the number of elements currently stored in a vector
20 println!("Vector length: {}", xs.len());
21
22 // Indexing is done using the square brackets (indexing starts at 0)
23 println!("Second element: {}", xs[1]);
24
25 // `pop` removes the last element from the vector and returns it
26 println!("Pop last element: {:?}", xs.pop());
27
28 // Out of bounds indexing yields a panic
29 println!("Fourth element: {}", xs[3]);
30 // FIXME ^ Comment out this line
31
32 // `Vector`s can be easily iterated over
33 println!("Contents of xs:");
34 for x in xs.iter() {
35 println!("> {}", x);
36 }
37
38 // A `Vector` can also be iterated over while the iteration
39 // count is enumerated in a separate variable (`i`)
40 for (i, x) in xs.iter().enumerate() {
41 println!("In position {} we have value {}", i, x);
42 }
43
44 // Thanks to `iter_mut`, mutable `Vector`s can also be iterated
45 // over in a way that allows modifying each value
46 for x in xs.iter_mut() {
47 *x *= 3;
48 }
49 println!("Updated vector: {:?}", xs);
50 }
Strings
The two most used string types in Rust are String and &str .
A String is stored as a vector of bytes ( Vec<u8> ), but guaranteed to always be a valid UTF-8 sequence.
String is heap allocated, growable and not null terminated.
&str is a slice ( &[u8] ) that always points to a valid UTF-8 sequence, and can be used to view into a
String , just like &[T] is a view into Vec<T> .
1 fn main() {
2 // (all the type annotations are superfluous)
3 // A reference to a string allocated in read only memory
4 let pangram: &'static str = "the quick brown fox jumps over the lazy dog";
5 println!("Pangram: {}", pangram);
6
7 // Iterate over words in reverse, no new string is allocated
8 println!("Words in reverse");
9 for word in pangram.split_whitespace().rev() {
10 println!("> {}", word);
11 }
12
13 // Copy chars into a vector, sort and remove duplicates
14 let mut chars: Vec<char> = pangram.chars().collect();
15 chars.sort();
16 chars.dedup();
17
18 // Create an empty and growable `String`
19 let mut string = String::new();
20 for c in chars {
21 // Insert a char at the end of string
22 string.push(c);
23 // Insert a string at the end of string
24 string.push_str(", ");
25 }
26
27 // The trimmed string is a slice to the original string, hence no new
28 // allocation is performed
29 let chars_to_trim: &[char] = &[' ', ','];
30 let trimmed_str: &str = string.trim_matches(chars_to_trim);
31 println!("Used characters: {}", trimmed_str);
32
33 // Heap allocate a string
34 let alice = String::from("I like dogs");
35 // Allocate new memory and store the modified string there
36 let bob: String = alice.replace("dog", "cat");
37
38 println!("Alice says: {}", alice);
39 println!("Bob says: {}", bob);
40 }
More str / String methods can be found under the std::str and std::string modules
Literals and escapes 402
There are multiple ways to write string literals with special characters in them. All result in a similar &str
so it's best to use the form that is the most convenient to write. Similarly there are multiple ways to write
byte string literals, which all result in &[u8; N] .
Generally special characters are escaped with a backslash character: \ . This way you can add any character
to your string, even unprintable ones and ones that you don't know how to type. If you want a literal
backslash, escape it with another one: \\
String or character literal delimiters occurring within a literal must be escaped: "\"" , '\'' .
1 fn main() {
2 // You can use escapes to write bytes by their hexadecimal values...
3 let byte_escape = "I'm writing \x52\x75\x73\x74!";
4 println!("What are you doing\x3F (\\x3F means ?) {}", byte_escape);
5
6 // ...or Unicode code points.
7 let unicode_codepoint = "\u{211D}";
8 let character_name = "\"DOUBLE-STRUCK CAPITAL R\"";
9
10 println!("Unicode character {} (U+211D) is called {}",
11 unicode_codepoint, character_name );
12
13
14 let long_string = "String literals
15 can span multiple lines.
16 The linebreak and indentation here ->\
17 <- can be escaped too!";
18 println!("{}", long_string);
19 }
Sometimes there are just too many characters that need to be escaped or it's just much more convenient to
write a string out as-is. This is where raw string literals come into play.
1 fn main() {
2 let raw_str = r"Escapes don't work here: \x3F \u{211D}";
3 println!("{}", raw_str);
4
5 // If you need quotes in a raw string, add a pair of #s
6 let quotes = r#"And then I said: "There is no escape!""#;
7 println!("{}", quotes);
8
9 // If you need "# in your string, just use more #s in the delimiter.
10 // You can use up to 255 #s.
11 let longer_delimiter = r###"A string with "# in it. And even "##!"###;
12 println!("{}", longer_delimiter);
13 }
Want a string that's not UTF-8? (Remember, str and String must be valid UTF-8). Or maybe you want
an array of bytes that's mostly text? Byte strings to the rescue!
1 use std::str; 403
2
3 fn main() {
4 // Note that this is not actually a `&str`
5 let bytestring: &[u8; 21] = b"this is a byte string";
6
7 // Byte arrays don't have the `Display` trait, so printing them is a bit limited
8 println!("A byte string: {:?}", bytestring);
9
10 // Byte strings can have byte escapes...
11 let escaped = b"\x52\x75\x73\x74 as bytes";
12 // ...but no unicode escapes
13 // let escaped = b"\u{211D} is not allowed";
14 println!("Some escaped bytes: {:?}", escaped);
15
16
17 // Raw byte strings work just like raw strings
18 let raw_bytestring = br"\u{211D} is not escaped here";
19 println!("{:?}", raw_bytestring);
20
21 // Converting a byte array to `str` can fail
22 if let Ok(my_str) = str::from_utf8(raw_bytestring) {
23 println!("And the same as text: '{}'", my_str);
24 }
25
26 let _quotes = br#"You can also use "fancier" formatting, \
27 like with normal raw strings"#;
28
29 // Byte strings don't have to be UTF-8
30 let shift_jis = b"\x82\xe6\x82\xa8\x82\xb1\x82\xbb"; // "よ う こ そ" in SHIFT-JIS
31
32 // But then they can't always be converted to `str`
33 match str::from_utf8(shift_jis) {
34 Ok(my_str) => println!("Conversion successful: '{}'", my_str),
35 Err(e) => println!("Conversion failed: {:?}", e),
36 };
37 }
For conversions between character encodings check out the encoding crate.
A more detailed listing of the ways to write string literals and escape characters is given in the 'Tokens'
chapter of the Rust Reference.
404
405
Option
Sometimes it's desirable to catch the failure of some parts of a program instead of calling panic! ; this can
be accomplished using the Option enum.
Result
We've seen that the Option enum can be used as a return value from functions that may fail, where None
can be returned to indicate failure. However, sometimes it is important to express why an operation failed.
To do this we have the Result enum.
Ok(value) which indicates that the operation succeeded, and wraps the value returned by the
operation. ( value has type T )
Err(why) , which indicates that the operation failed, and wraps why , which (hopefully) explains the
cause of the failure. ( why has type E )
1 mod checked { 408
2 // Mathematical "errors" we want to catch
3 #[derive(Debug)]
4 pub enum MathError {
5 DivisionByZero,
6 NonPositiveLogarithm,
7 NegativeSquareRoot,
8 }
9
10 pub type MathResult = Result<f64, MathError>;
11
12 pub fn div(x: f64, y: f64) -> MathResult {
13 if y == 0.0 {
14 // This operation would `fail`, instead let's return the reason of
15 // the failure wrapped in `Err`
16 Err(MathError::DivisionByZero)
17 } else {
18 // This operation is valid, return the result wrapped in `Ok`
19 Ok(x / y)
20 }
21 }
22
23 pub fn sqrt(x: f64) -> MathResult {
24 if x < 0.0 {
25 Err(MathError::NegativeSquareRoot)
26 } else {
27 Ok(x.sqrt())
28 }
29 }
30
31 pub fn ln(x: f64) -> MathResult {
32 if x <= 0.0 {
33 Err(MathError::NonPositiveLogarithm)
34 } else {
35 Ok(x.ln())
36 }
37 }
38 }
39
40 // `op(x, y)` === `sqrt(ln(x / y))`
41 fn op(x: f64, y: f64) -> f64 {
42 // This is a three level match pyramid!
43 match checked::div(x, y) {
44 Err(why) => panic!("{:?}", why),
45 Ok(ratio) => match checked::ln(ratio) {
46 Err(why) => panic!("{:?}", why),
47 Ok(ln) => match checked::sqrt(ln) {
48 Err(why) => panic!("{:?}", why),
49 Ok(sqrt) => sqrt,
50 },
51 },
52 }
53 }
54
55 fn main() {
56 // Will this fail?
57 println!("{}", op(1.0, 10.0));
58 }
409
410
?
Chaining results using match can get pretty untidy; luckily, the ? operator can be used to make things
pretty again. ? is used at the end of an expression returning a Result , and is equivalent to a match
expression, where the Err(err) branch expands to an early return Err(From::from(err)) , and the
Ok(ok) branch expands to an ok expression.
1 mod checked { 411
2 #[derive(Debug)]
3 enum MathError {
4 DivisionByZero,
5 NonPositiveLogarithm,
6 NegativeSquareRoot,
7 }
8
9 type MathResult = Result<f64, MathError>;
10
11 fn div(x: f64, y: f64) -> MathResult {
12 if y == 0.0 {
13 Err(MathError::DivisionByZero)
14 } else {
15 Ok(x / y)
16 }
17 }
18
19 fn sqrt(x: f64) -> MathResult {
20 if x < 0.0 {
21 Err(MathError::NegativeSquareRoot)
22 } else {
23 Ok(x.sqrt())
24 }
25 }
26
27 fn ln(x: f64) -> MathResult {
28 if x <= 0.0 {
29 Err(MathError::NonPositiveLogarithm)
30 } else {
31 Ok(x.ln())
32 }
33 }
34
35 // Intermediate function
36 fn op_(x: f64, y: f64) -> MathResult {
37 // if `div` "fails", then `DivisionByZero` will be `return`ed
38 let ratio = div(x, y)?;
39
40 // if `ln` "fails", then `NonPositiveLogarithm` will be `return`ed
41 let ln = ln(ratio)?;
42
43 sqrt(ln)
44 }
45
46 pub fn op(x: f64, y: f64) {
47 match op_(x, y) {
48 Err(why) => panic!("{}", match why {
49 MathError::NonPositiveLogarithm
50 => "logarithm of non-positive number",
51 MathError::DivisionByZero
52 => "division by zero",
53 MathError::NegativeSquareRoot
54 => "square root of negative number",
55 }),
56 Ok(value) => println!("{}", value),
57 }
58 }
59 }
60
61 fn main() {
62 checked::op(1.0, 10.0);
63 }
412
Be sure to check the documentation, as there are many methods to map/compose Result .
413
414
panic!
The panic! macro can be used to generate a panic and start unwinding its stack. While unwinding, the
runtime will take care of freeing all the resources owned by the thread by calling the destructor of all its
objects.
Since we are dealing with programs with only one thread, panic! will cause the program to report the
panic message and exit.
HashMap
Where vectors store values by an integer index, HashMap s store values by key. HashMap keys can be
booleans, integers, strings, or any other type that implements the Eq and Hash traits. More on this in the
next section.
Like vectors, HashMap s are growable, but HashMaps can also shrink themselves when they have excess
space. You can create a HashMap with a certain starting capacity using HashMap::with_capacity(uint) , or
use HashMap::new() to get a HashMap with a default initial capacity (recommended).
1 use std::collections::HashMap;
2
3 fn call(number: &str) -> &str {
4 match number {
5 "798-1364" => "We're sorry, the call cannot be completed as dialed.
6 Please hang up and try again.",
7 "645-7689" => "Hello, this is Mr. Awesome's Pizza. My name is Fred.
8 What can I get for you today?",
9 _ => "Hi! Who is this again?"
10 }
11 }
12
13 fn main() {
14 let mut contacts = HashMap::new();
15
16 contacts.insert("Daniel", "798-1364");
17 contacts.insert("Ashley", "645-7689");
18 contacts.insert("Katie", "435-8291");
19 contacts.insert("Robert", "956-1745");
20
21 // Takes a reference and returns Option<&V>
22 match contacts.get(&"Daniel") {
23 Some(&number) => println!("Calling Daniel: {}", call(number)),
24 _ => println!("Don't have Daniel's number."),
25 }
26
27 // `HashMap::insert()` returns `None`
28 // if the inserted value is new, `Some(value)` otherwise
29 contacts.insert("Daniel", "164-6743");
30
31 match contacts.get(&"Ashley") {
32 Some(&number) => println!("Calling Ashley: {}", call(number)),
33 _ => println!("Don't have Ashley's number."),
34 }
35
36 contacts.remove(&"Ashley");
37
38 // `HashMap::iter()` returns an iterator that yields
39 // (&'a key, &'a value) pairs in arbitrary order.
40 for (contact, &number) in contacts.iter() {
41 println!("Calling {}: {}", contact, call(number));
42 }
43 }
For more information on how hashing and hash maps (sometimes called hash tables) work, have a look at
Hash Table Wikipedia
417
418
bool (though not very useful since there are only two possible keys)
int , uint , and all variations thereof
String and &str (protip: you can have a HashMap keyed by String and call .get() with an
&str )
Note that f32 and f64 do not implement Hash , likely because floating-point precision errors would
make using them as hashmap keys horribly error-prone.
All collection classes implement Eq and Hash if their contained type also respectively implements Eq
and Hash . For example, Vec<T> will implement Hash if T implements Hash .
You can easily implement Eq and Hash for a custom type with just one line: #[derive(PartialEq, Eq,
Hash)]
The compiler will do the rest. If you want more control over the details, you can implement Eq and/or
Hash yourself. This guide will not cover the specifics of implementing Hash .
To play around with using a struct in HashMap , let's try making a very simple user logon system:
1 use std::collections::HashMap; 419
2
3 // Eq requires that you derive PartialEq on the type.
4 #[derive(PartialEq, Eq, Hash)]
5 struct Account<'a>{
6 username: &'a str,
7 password: &'a str,
8 }
9
10 struct AccountInfo<'a>{
11 name: &'a str,
12 email: &'a str,
13 }
14
15 type Accounts<'a> = HashMap<Account<'a>, AccountInfo<'a>>;
16
17 fn try_logon<'a>(accounts: &Accounts<'a>,
18 username: &'a str, password: &'a str){
19 println!("Username: {}", username);
20 println!("Password: {}", password);
21 println!("Attempting logon...");
22
23 let logon = Account {
24 username,
25 password,
26 };
27
28 match accounts.get(&logon) {
29 Some(account_info) => {
30 println!("Successful logon!");
31 println!("Name: {}", account_info.name);
32 println!("Email: {}", account_info.email);
33 },
34 _ => println!("Login failed!"),
35 }
36 }
37
38 fn main(){
39 let mut accounts: Accounts = HashMap::new();
40
41 let account = Account {
42 username: "j.everyman",
43 password: "password123",
44 };
45
46 let account_info = AccountInfo {
47 name: "John Everyman",
48 email: "[email protected]",
49 };
50
51 accounts.insert(account, account_info);
52
53 try_logon(&accounts, "j.everyman", "psasword123");
54
55 try_logon(&accounts, "j.everyman", "password123");
56 }
420
421
HashSet
Consider a HashSet as a HashMap where we just care about the keys ( HashSet<T> is, in actuality, just a
wrapper around HashMap<T, ()> ).
"What's the point of that?" you ask. "I could just store the keys in a Vec ."
A HashSet 's unique feature is that it is guaranteed to not have duplicate elements. That's the contract that
any set collection fulfills. HashSet is just one implementation. (see also: BTreeSet )
If you insert a value that is already present in the HashSet , (i.e. the new value is equal to the existing and
they both have the same hash), then the new value will replace the old.
This is great for when you never want more than one of something, or when you want to know if you've
already got something.
Sets have 4 primary operations (all of the following calls return an iterator):
difference : get all the elements that are in the first set but not the second.
intersection : get all the elements that are only in both sets.
symmetric_difference : get all the elements that are in one set or the other, but not both.
Rc
When multiple ownership is needed, Rc (Reference Counting) can be used. Rc keeps track of the number
of the references which means the number of owners of the value wrapped inside an Rc .
Cloning an Rc never performs a deep copy. Cloning creates just another pointer to the wrapped value, and
increments the count.
1 use std::rc::Rc;
2
3 fn main() {
4 let rc_examples = "Rc examples".to_string();
5 {
6 println!("--- rc_a is created ---");
7
8 let rc_a: Rc<String> = Rc::new(rc_examples);
9 println!("Reference Count of rc_a: {}", Rc::strong_count(&rc_a));
10
11 {
12 println!("--- rc_a is cloned to rc_b ---");
13
14 let rc_b: Rc<String> = Rc::clone(&rc_a);
15 println!("Reference Count of rc_b: {}", Rc::strong_count(&rc_b));
16 println!("Reference Count of rc_a: {}", Rc::strong_count(&rc_a));
17
18 // Two `Rc`s are equal if their inner values are equal
19 println!("rc_a and rc_b are equal: {}", rc_a.eq(&rc_b));
20
21 // We can use methods of a value directly
22 println!("Length of the value inside rc_a: {}", rc_a.len());
23 println!("Value of rc_b: {}", rc_b);
24
25 println!("--- rc_b is dropped out of scope ---");
26 }
27
28 println!("Reference Count of rc_a: {}", Rc::strong_count(&rc_a));
29
30 println!("--- rc_a is dropped out of scope ---");
31 }
32
33 // Error! `rc_examples` already moved into `rc_a`
34 // And when `rc_a` is dropped, `rc_examples` is dropped together
35 // println!("rc_examples: {}", rc_examples);
36 // TODO ^ Try uncommenting this line
37 }
See also:
Arc
When shared ownership between threads is needed, Arc (Atomically Reference Counted) can be used.
This struct, via the Clone implementation can create a reference pointer for the location of a value in the
memory heap while increasing the reference counter. As it shares ownership between threads, when the
last reference pointer to a value is out of scope, the variable is dropped.
1 use std::time::Duration;
2 use std::sync::Arc;
3 use std::thread;
4
5 fn main() {
6 // This variable declaration is where its value is specified.
7 let apple = Arc::new("the same apple");
8
9 for _ in 0..10 {
10 // Here there is no value specification as it is a pointer to a
11 // reference in the memory heap.
12 let apple = Arc::clone(&apple);
13
14 thread::spawn(move || {
15 // As Arc was used, threads can be spawned using the value allocated
16 // in the Arc variable pointer's location.
17 println!("{:?}", apple);
18 });
19 }
20
21 // Make sure all Arc instances are printed from spawned threads.
22 thread::sleep(Duration::from_secs(1));
23 }
427
428
Std misc
Many other types are provided by the std library to support things such as:
Threads
Channels
File I/O
See also:
Threads
Rust provides a mechanism for spawning native OS threads via the spawn function, the argument of this
function is a moving closure.
1 use std::thread;
2
3 const NTHREADS: u32 = 10;
4
5 // This is the `main` thread
6 fn main() {
7 // Make a vector to hold the children which are spawned.
8 let mut children = vec![];
9
10 for i in 0..NTHREADS {
11 // Spin up another thread
12 children.push(thread::spawn(move || {
13 println!("this is thread number {}", i);
14 }));
15 }
16
17 for child in children {
18 // Wait for the thread to finish. Returns a result.
19 let _ = child.join();
20 }
21 }
Testcase: map-reduce
Rust makes it very easy to parallelise data processing, without many of the headaches traditionally
associated with such an attempt.
The standard library provides great threading primitives out of the box. These, combined with Rust's
concept of Ownership and aliasing rules, automatically prevent data races.
The aliasing rules (one writable reference XOR many readable references) automatically prevent you from
manipulating state that is visible to other threads. (Where synchronisation is needed, there are
synchronisation primitives like Mutex es or Channel s.)
In this example, we will calculate the sum of all digits in a block of numbers. We will do this by parcelling
out chunks of the block into different threads. Each thread will sum its tiny block of digits, and
subsequently we will sum the intermediate sums produced by each thread.
Note that, although we're passing references across thread boundaries, Rust understands that we're only
passing read-only references, and that thus no unsafety or data races can occur. Also because the
references we're passing have 'static lifetimes, Rust understands that our data won't be destroyed while
these threads are still running. (When you need to share non- static data between threads, you can use a
smart pointer like Arc to keep the data alive and avoid non- static lifetimes.)
1 use std::thread; 433
2
3 // This is the `main` thread
4 fn main() {
5
6 // This is our data to process.
7 // We will calculate the sum of all digits via a threaded map-reduce algorithm.
8 // Each whitespace separated chunk will be handled in a different thread.
9 //
10 // TODO: see what happens to the output if you insert spaces!
11 let data = "86967897737416471853297327050364959
12 11861322575564723963297542624962850
13 70856234701860851907960690014725639
14 38397966707106094172783238747669219
15 52380795257888236525459303330302837
16 58495327135744041048897885734297812
17 69920216438980873548808413720956532
18 16278424637452589860345374828574668";
19
20 // Make a vector to hold the child-threads which we will spawn.
21 let mut children = vec![];
22
23 /*************************************************************************
24 * "Map" phase
25 *
26 * Divide our data into segments, and apply initial processing
27 ************************************************************************/
28
29 // split our data into segments for individual calculation
30 // each chunk will be a reference (&str) into the actual data
31 let chunked_data = data.split_whitespace();
32
33 // Iterate over the data segments.
34 // .enumerate() adds the current loop index to whatever is iterated
35 // the resulting tuple "(index, element)" is then immediately
36 // "destructured" into two variables, "i" and "data_segment" with a
37 // "destructuring assignment"
38 for (i, data_segment) in chunked_data.enumerate() {
39 println!("data segment {} is \"{}\"", i, data_segment);
40
41 // Process each data segment in a separate thread
42 //
43 // spawn() returns a handle to the new thread,
44 // which we MUST keep to access the returned value
45 //
46 // 'move || -> u32' is syntax for a closure that:
47 // * takes no arguments ('||')
48 // * takes ownership of its captured variables ('move') and
49 // * returns an unsigned 32-bit integer ('-> u32')
50 //
51 // Rust is smart enough to infer the '-> u32' from
52 // the closure itself so we could have left that out.
53 //
54 // TODO: try removing the 'move' and see what happens
55 children.push(thread::spawn(move || -> u32 {
56 // Calculate the intermediate sum of this segment:
57 let result = data_segment
58 // iterate over the characters of our segment..
59 .chars()
60 // .. convert text-characters to their number value..
61 .map(|c| c.to_digit(10).expect("should be a digit"))
62 // .. and sum the resulting iterator of numbers
63 .sum();
64 434
65 // println! locks stdout, so no text-interleaving occurs
66 println!("processed segment {}, result={}", i, result);
67
68 // "return" not needed, because Rust is an "expression language", the
69 // last evaluated expression in each block is automatically its value.
70 result
71
72 }));
73 }
74
75
76 /*************************************************************************
77 * "Reduce" phase
78 *
79 * Collect our intermediate results, and combine them into a final result
80 ************************************************************************/
81
82 // combine each thread's intermediate results into a single final sum.
83 //
84 // we use the "turbofish" ::<> to provide sum() with a type hint.
85 //
86 // TODO: try without the turbofish, by instead explicitly
87 // specifying the type of final_result
88 let final_result = children.into_iter().map(|c| c.join().unwrap()).sum::<u32>();
89
90 println!("Final sum result: {}", final_result);
91 }
92
93
Assignments
It is not wise to let our number of threads depend on user inputted data. What if the user decides to insert a
lot of spaces? Do we really want to spawn 2,000 threads? Modify the program so that the data is always
chunked into a limited number of chunks, defined by a static constant at the beginning of the program.
See also:
Threads
vectors and iterators
closures, move semantics and move closures
destructuring assignments
turbofish notation to help type inference
unwrap vs. expect
enumerate
435
436
Channels
Rust provides asynchronous channels for communication between threads. Channels allow a
unidirectional flow of information between two end-points: the Sender and the Receiver .
Path
The Path struct represents file paths in the underlying filesystem. There are two flavors of Path :
posix::Path , for UNIX-like systems, and windows::Path , for Windows. The prelude exports the
appropriate platform-specific Path variant.
A Path can be created from an OsStr , and provides several methods to get information from the
file/directory the path points to.
A Path is immutable. The owned version of Path is PathBuf . The relation between Path and PathBuf
is similar to that of str and String : a PathBuf can be mutated in-place, and can be dereferenced to a
Path .
Note that a Path is not internally represented as an UTF-8 string, but instead is stored as an OsString .
Therefore, converting a Path to a &str is not free and may fail (an Option is returned). However, a Path
can be freely converted to an OsString or &OsStr using into_os_string and as_os_str , respectively.
1 use std::path::Path;
2
3 fn main() {
4 // Create a `Path` from an `&'static str`
5 let path = Path::new(".");
6
7 // The `display` method returns a `Display`able structure
8 let _display = path.display();
9
10 // `join` merges a path with a byte container using the OS specific
11 // separator, and returns a `PathBuf`
12 let mut new_path = path.join("a").join("b");
13
14 // `push` extends the `PathBuf` with a `&Path`
15 new_path.push("c");
16 new_path.push("myfile.tar.gz");
17
18 // `set_file_name` updates the file name of the `PathBuf`
19 new_path.set_file_name("package.tgz");
20
21 // Convert the `PathBuf` into a string slice
22 match new_path.to_str() {
23 None => panic!("new path is not a valid UTF-8 sequence"),
24 Some(s) => println!("new path is {}", s),
25 }
26 }
27
Be sure to check at other Path methods ( posix::Path or windows::Path ) and the Metadata struct.
See also:
File I/O
The File struct represents a file that has been opened (it wraps a file descriptor), and gives read and/or
write access to the underlying file.
Since many things can go wrong when doing file I/O, all the File methods return the io::Result<T>
type, which is an alias for Result<T, io::Error> .
This makes the failure of all I/O operations explicit. Thanks to this, the programmer can see all the failure
paths, and is encouraged to handle them in a proactive manner.
441
442
open
The open function can be used to open a file in read-only mode.
A File owns a resource, the file descriptor and takes care of closing the file when it is drop ed.
1 use std::fs::File;
2 use std::io::prelude::*;
3 use std::path::Path;
4
5 fn main() {
6 // Create a path to the desired file
7 let path = Path::new("hello.txt");
8 let display = path.display();
9
10 // Open the path in read-only mode, returns `io::Result<File>`
11 let mut file = match File::open(&path) {
12 Err(why) => panic!("couldn't open {}: {}", display, why),
13 Ok(file) => file,
14 };
15
16 // Read the file contents into a string, returns `io::Result<usize>`
17 let mut s = String::new();
18 match file.read_to_string(&mut s) {
19 Err(why) => panic!("couldn't read {}: {}", display, why),
20 Ok(_) => print!("{} contains:\n{}", display, s),
21 }
22
23 // `file` goes out of scope, and the "hello.txt" file gets closed
24 }
(You are encouraged to test the previous example under different failure conditions: hello.txt doesn't
exist, or hello.txt is not readable, etc.)
443
444
create
The create function opens a file in write-only mode. If the file already existed, the old content is
destroyed. Otherwise, a new file is created.
use std::fs::File;
use std::io::prelude::*;
use std::path::Path;
fn main() {
let path = Path::new("lorem_ipsum.txt");
let display = path.display();
(As in the previous example, you are encouraged to test this example under failure conditions.)
read_lines
A naive approach
This might be a reasonable first attempt for a beginner's first implementation for reading lines from a file.
use std::fs::read_to_string;
result
}
Since the method lines() returns an iterator over the lines in the file, we can also perform a map inline
and collect the results, yielding a more concise and fluent expression.
use std::fs::read_to_string;
Note that in both examples above, we must convert the &str reference returned from lines() to the
owned type String , using .to_string() and String::from respectively.
We also update read_lines to return an iterator instead of allocating new String objects in memory for
each line.
use std::fs::File; 447
use std::io::{self, BufRead};
use std::path::Path;
fn main() {
// File hosts.txt must exist in the current path
if let Ok(lines) = read_lines("./hosts.txt") {
// Consumes the iterator, returns an (Optional) String
for line in lines.flatten() {
println!("{}", line);
}
}
}
(Note that since File::open expects a generic AsRef<Path> as argument, we define our generic
read_lines() method with the same generic constraint, using the where keyword.)
This process is more efficient than creating a String in memory with all of the file's contents. This can
especially cause performance issues when working with larger files.
448
449
Child processes
The process::Output struct represents the output of a finished child process, and the process::Command
struct is a process builder.
1 use std::process::Command;
2
3 fn main() {
4 let output = Command::new("rustc")
5 .arg("--version")
6 .output().unwrap_or_else(|e| {
7 panic!("failed to execute process: {}", e)
8 });
9
10 if output.status.success() {
11 let s = String::from_utf8_lossy(&output.stdout);
12
13 print!("rustc succeeded and stdout was:\n{}", s);
14 } else {
15 let s = String::from_utf8_lossy(&output.stderr);
16
17 print!("rustc failed and stderr was:\n{}", s);
18 }
19 }
(You are encouraged to try the previous example with an incorrect flag passed to rustc )
450
451
Pipes
The std::Child struct represents a running child process, and exposes the stdin , stdout and stderr
handles for interaction with the underlying process via pipes.
use std::io::prelude::*;
use std::process::{Command, Stdio};
fn main() {
// Spawn the `wc` command
let mut cmd = if cfg!(target_family = "windows") {
let mut cmd = Command::new("powershell");
cmd.arg("-Command").arg("$input | Measure-Object -Line -Word -Character");
cmd
} else {
Command::new("wc")
};
let process = match cmd
.stdin(Stdio::piped())
.stdout(Stdio::piped())
.spawn() {
Err(why) => panic!("couldn't spawn wc: {}", why),
Ok(process) => process,
};
// Because `stdin` does not live after the above calls, it is `drop`ed,
// and the pipe is closed.
//
// This is very important, otherwise `wc` wouldn't start processing the
// input we just sent.
Wait
If you'd like to wait for a process::Child to finish, you must call Child::wait , which will return a
process::ExitStatus .
use std::process::Command;
fn main() {
let mut child = Command::new("sleep").arg("5").spawn().unwrap();
let _result = child.wait().unwrap();
Filesystem Operations
The std::fs module contains several functions that deal with the filesystem.
use std::fs; 456
use std::fs::{File, OpenOptions};
use std::io;
use std::io::prelude::*;
#[cfg(target_family = "unix")]
use std::os::unix;
#[cfg(target_family = "windows")]
use std::os::windows;
use std::path::Path;
f.write_all(s.as_bytes())
}
fn main() {
println!("`mkdir a`");
// Create a directory, returns `io::Result<()>`
match fs::create_dir("a") {
Err(why) => println!("! {:?}", why.kind()),
Ok(_) => {},
}
println!("`mkdir -p a/c/d`");
// Recursively create a directory, returns `io::Result<()>`
fs::create_dir_all("a/c/d").unwrap_or_else(|why| {
println!("! {:?}", why.kind());
});
println!("`touch a/c/e.txt`");
touch(&Path::new("a/c/e.txt")).unwrap_or_else(|why| {
println!("! {:?}", why.kind());
});
println!("`cat a/c/b.txt`");
match cat(&Path::new("a/c/b.txt")) {
Err(why) => println!("! {:?}", why.kind()),
Ok(s) => println!("> {}", s),
}
println!("`ls a`");
// Read the contents of a directory, returns `io::Result<Vec<Path>>`
match fs::read_dir("a") {
Err(why) => println!("! {:?}", why.kind()),
Ok(paths) => for path in paths {
println!("> {:?}", path.unwrap().path());
},
}
println!("`rm a/c/e.txt`");
// Remove a file, returns `io::Result<()>`
fs::remove_file("a/c/e.txt").unwrap_or_else(|why| {
println!("! {:?}", why.kind());
});
println!("`rmdir a/c/d`");
// Remove an empty directory, returns `io::Result<()>`
fs::remove_dir("a/c/d").unwrap_or_else(|why| {
println!("! {:?}", why.kind());
});
}
1 directory, 2 files
See also:
cfg!
459
460
Program arguments
Standard Library
The command line arguments can be accessed using std::env::args , which returns an iterator that yields
a String for each argument:
1 use std::env;
2
3 fn main() {
4 let args: Vec<String> = env::args().collect();
5
6 // The first argument is the path that was used to call the program.
7 println!("My path is {}.", args[0]);
8
9 // The rest of the arguments are the passed command line parameters.
10 // Call the program like this:
11 // $ ./args arg1 arg2
12 println!("I got {:?} arguments: {:?}.", args.len() - 1, &args[1..]);
13 }
$ ./args 1 2 3
My path is ./args.
I got 3 arguments: ["1", "2", "3"].
Crates
Alternatively, there are numerous crates that can provide extra functionality when creating command-line
applications. One of the more popular command line argument crates being clap .
461
462
Argument parsing
Matching can be used to parse simple arguments:
1 use std::env; 463
2
3 fn increase(number: i32) {
4 println!("{}", number + 1);
5 }
6
7 fn decrease(number: i32) {
8 println!("{}", number - 1);
9 }
10
11 fn help() {
12 println!("usage:
13 match_args <string>
14 Check whether given string is the answer.
15 match_args {{increase|decrease}} <integer>
16 Increase or decrease given integer by one.");
17 }
18
19 fn main() {
20 let args: Vec<String> = env::args().collect();
21
22 match args.len() {
23 // no arguments passed
24 1 => {
25 println!("My name is 'match_args'. Try passing some arguments!");
26 },
27 // one argument passed
28 2 => {
29 match args[1].parse() {
30 Ok(42) => println!("This is the answer!"),
31 _ => println!("This is not the answer."),
32 }
33 },
34 // one command and one argument passed
35 3 => {
36 let cmd = &args[1];
37 let num = &args[2];
38 // parse the number
39 let number: i32 = match num.parse() {
40 Ok(n) => {
41 n
42 },
43 Err(_) => {
44 eprintln!("error: second argument not an integer");
45 help();
46 return;
47 },
48 };
49 // parse the command
50 match &cmd[..] {
51 "increase" => increase(number),
52 "decrease" => decrease(number),
53 _ => {
54 eprintln!("error: invalid command");
55 help();
56 },
57 }
58 },
59 // all the other cases
60 _ => {
61 // show a help message
62 help();
63 }
64 } 464
65 }
If you named your program match_args.rs and compile it like this rustc match_args.rs , you can
execute it as follows:
$ ./match_args Rust
This is not the answer.
$ ./match_args 42
This is the answer!
$ ./match_args do something
error: second argument not an integer
usage:
match_args <string>
Check whether given string is the answer.
match_args {increase|decrease} <integer>
Increase or decrease given integer by one.
$ ./match_args do 42
error: invalid command
usage:
match_args <string>
Check whether given string is the answer.
match_args {increase|decrease} <integer>
Increase or decrease given integer by one.
$ ./match_args increase 42
43
465
466
fn main() {
// z = -1 + 0i
let z = Complex { re: -1., im: 0. };
Testing
Rust is a programming language that cares a lot about correctness and it includes support for writing
software tests within the language itself.
Unit testing.
Doc testing.
Integration testing.
Also Rust has support for specifying additional dependencies for tests:
Dev-dependencies
See Also
The Book chapter on testing
API Guidelines on doc-testing
470
471
Unit testing
Tests are Rust functions that verify that the non-test code is functioning in the expected manner. The
bodies of test functions typically perform some setup, run the code we want to test, then assert whether the
results are what we expect.
Most unit tests go into a tests mod with the #[cfg(test)] attribute. Test functions are marked with the
#[test] attribute.
Tests fail when something in the test function panics. There are some helper macros:
#[cfg(test)]
mod tests {
// Note this useful idiom: importing names from outer (for mod tests) scope.
use super::*;
#[test]
fn test_add() {
assert_eq!(add(1, 2), 3);
}
#[test]
fn test_bad_add() {
// This assert would fire and test will fail.
// Please note, that private functions can be tested too!
assert_eq!(bad_add(1, 2), 3);
}
}
running 2 tests
test tests::test_bad_add ... FAILED
test tests::test_add ... ok
failures:
failures:
tests::test_bad_add
Tests and ?
None of the previous unit test examples had a return type. But in Rust 2018, your unit tests can return
Result<()> , which lets you use ? in them! This can make them much more concise.
Testing panics
To check functions that should panic under certain circumstances, use attribute #[should_panic] . This
attribute accepts optional parameter expected = with the text of the panic message. If your function can
panic in multiple ways, it helps make sure your test is testing the correct panic.
pub fn divide_non_zero_result(a: u32, b: u32) -> u32 { 473
if b == 0 {
panic!("Divide-by-zero error");
} else if a < b {
panic!("Divide result is zero");
}
a / b
}
#[cfg(test)]
mod tests {
use super::*;
#[test]
fn test_divide() {
assert_eq!(divide_non_zero_result(10, 2), 5);
}
#[test]
#[should_panic]
fn test_any_panic() {
divide_non_zero_result(1, 0);
}
#[test]
#[should_panic(expected = "Divide result is zero")]
fn test_specific_panic() {
divide_non_zero_result(1, 10);
}
}
$ cargo test
running 3 tests
test tests::test_any_panic ... ok
test tests::test_divide ... ok
test tests::test_specific_panic ... ok
Doc-tests tmp-test-should-panic
running 0 tests
Doc-tests tmp-test-should-panic
running 0 tests
To run multiple tests one may specify part of a test name that matches all the tests that should be run.
Doc-tests tmp-test-should-panic
running 0 tests
Ignoring tests
Tests can be marked with the #[ignore] attribute to exclude some tests. Or to run them with command
cargo test -- --ignored
pub fn add(a: i32, b: i32) -> i32 { 475
a + b
}
#[cfg(test)]
mod tests {
use super::*;
#[test]
fn test_add() {
assert_eq!(add(2, 2), 4);
}
#[test]
fn test_add_hundred() {
assert_eq!(add(100, 2), 102);
assert_eq!(add(2, 100), 102);
}
#[test]
#[ignore]
fn ignored_test() {
assert_eq!(add(0, 0), 0);
}
}
$ cargo test
running 3 tests
test tests::ignored_test ... ignored
test tests::test_add ... ok
test tests::test_add_hundred ... ok
Doc-tests tmp-ignore
running 0 tests
Doc-tests tmp-ignore
running 0 tests
Documentation testing
The primary way of documenting a Rust project is through annotating the source code. Documentation
comments are written in CommonMark Markdown specification and support code blocks in them. Rust
takes care about correctness, so these code blocks are compiled and used as documentation tests.
/// Usually doc comments may include sections "Examples", "Panics" and "Failures".
///
/// The next function divides two numbers.
///
/// # Examples
///
/// ```
/// let result = doccomments::div(10, 2);
/// assert_eq!(result, 5);
/// ```
///
/// # Panics
///
/// The function panics if the second argument is zero.
///
/// ```rust,should_panic
/// // panics on division by zero
/// doccomments::div(10, 0);
/// ```
pub fn div(a: i32, b: i32) -> i32 {
if b == 0 {
panic!("Divide-by-zero error");
}
a / b
}
Code blocks in documentation are automatically tested when running the regular cargo test command:
$ cargo test 478
running 0 tests
Doc-tests doccomments
running 3 tests
test src/lib.rs - add (line 7) ... ok
test src/lib.rs - div (line 21) ... ok
test src/lib.rs - div (line 31) ... ok
See Also
RFC505 on documentation style
API Guidelines on documentation guidelines
479
480
Integration testing
Unit tests are testing one module in isolation at a time: they're small and can test private code. Integration
tests are external to your crate and use only its public interface in the same way any other code would.
Their purpose is to test that many parts of your library work correctly together.
File src/lib.rs :
#[test]
fn test_add() {
assert_eq!(adder::add(3, 2), 5);
}
$ cargo test
running 0 tests
Running target/debug/deps/integration_test-bcd60824f5fbfe19
running 1 test
test test_add ... ok
Doc-tests adder
running 0 tests
Each Rust source file in the tests directory is compiled as a separate crate. In order to share some code
between integration tests we can make a module with public functions, importing and using it within tests.
File tests/common/mod.rs :
pub fn setup() {
// some setup code, like creating required files/directories, starting
// servers, etc.
}
#[test]
fn test_add() {
// using common code.
common::setup();
assert_eq!(adder::add(3, 2), 5);
}
Creating the module as tests/common.rs also works, but is not recommended because the test runner
will treat the file as a test crate and try to run tests inside it.
482
483
Development dependencies
Sometimes there is a need to have dependencies for tests (or examples, or benchmarks) only. Such
dependencies are added to Cargo.toml in the [dev-dependencies] section. These dependencies are not
propagated to other packages which depend on this package.
One such example is pretty_assertions , which extends standard assert_eq! and assert_ne! macros,
to provide colorful diff.
File Cargo.toml :
File src/lib.rs :
#[cfg(test)]
mod tests {
use super::*;
use pretty_assertions::assert_eq; // crate for test-only use. Cannot be used in non-
test code.
#[test]
fn test_add() {
assert_eq!(add(2, 3), 5);
}
}
See Also
Cargo docs on specifying dependencies.
484
485
Unsafe Operations
As an introduction to this section, to borrow from the official docs, "one should try to minimize the amount
of unsafe code in a code base." With that in mind, let's get started! Unsafe annotations in Rust are used to
bypass protections put in place by the compiler; specifically, there are four primary things that unsafe is
used for:
Raw Pointers
Raw pointers * and references &T function similarly, but references are always safe because they are
guaranteed to point to valid data due to the borrow checker. Dereferencing a raw pointer can only be done
through an unsafe block.
1 fn main() {
2 let raw_p: *const u32 = &10;
3
4 unsafe {
5 assert!(*raw_p == 10);
6 }
7 }
Some functions can be declared as unsafe , meaning it is the programmer's responsibility to ensure
correctness instead of the compiler's. One example of this is std::slice::from_raw_parts which will
create a slice given a pointer to the first element and a length.
1 use std::slice;
2
3 fn main() {
4 let some_vector = vec![1, 2, 3, 4];
5
6 let pointer = some_vector.as_ptr();
7 let length = some_vector.len();
8
9 unsafe {
10 let my_slice: &[u32] = slice::from_raw_parts(pointer, length);
11
12 assert_eq!(some_vector.as_slice(), my_slice);
13 }
14 }
For slice::from_raw_parts , one of the assumptions which must be upheld is that the pointer passed in
points to valid memory and that the memory pointed to is of the correct type. If these invariants aren't
upheld then the program's behaviour is undefined and there is no knowing what will happen. 486
487
488
Inline assembly
Rust provides support for inline assembly via the asm! macro. It can be used to embed handwritten
assembly in the assembly output generated by the compiler. Generally this should not be necessary, but
might be where the required performance or timing cannot be otherwise achieved. Accessing low level
hardware primitives, e.g. in kernel code, may also demand this functionality.
Note: the examples here are given in x86/x86-64 assembly, but other architectures are also
supported.
Basic usage
Let us start with the simplest possible example:
use std::arch::asm;
unsafe {
asm!("nop");
}
This will insert a NOP (no operation) instruction into the assembly generated by the compiler. Note that all
asm! invocations have to be inside an unsafe block, as they could insert arbitrary instructions and break
various invariants. The instructions to be inserted are listed in the first argument of the asm! macro as a
string literal.
use std::arch::asm;
let x: u64;
unsafe {
asm!("mov {}, 5", out(reg) x);
}
assert_eq!(x, 5);
This will write the value 5 into the u64 variable x . You can see that the string literal we use to specify 489
instructions is actually a template string. It is governed by the same rules as Rust format strings. The
arguments that are inserted into the template however look a bit different than you may be familiar with.
First we need to specify if the variable is an input or an output of the inline assembly. In this case it is an
output. We declared this by writing out . We also need to specify in what kind of register the assembly
expects the variable. In this case we put it in an arbitrary general purpose register by specifying reg . The
compiler will choose an appropriate register to insert into the template and will read the variable from there
after the inline assembly finishes executing.
use std::arch::asm;
let i: u64 = 3;
let o: u64;
unsafe {
asm!(
"mov {0}, {1}",
"add {0}, 5",
out(reg) o,
in(reg) i,
);
}
assert_eq!(o, 8);
This will add 5 to the input in variable i and write the result to variable o . The particular way this
assembly does this is first copying the value from i to the output, and then adding 5 to it.
First, we can see that asm! allows multiple template string arguments; each one is treated as a separate
line of assembly code, as if they were all joined together with newlines between them. This makes it easy
to format assembly code.
Second, we can see that inputs are declared by writing in instead of out .
Third, we can see that we can specify an argument number, or name as in any format string. For inline
assembly templates this is particularly useful as arguments are often used more than once. For more
complex inline assembly using this facility is generally recommended, as it improves readability, and
allows reordering instructions without changing the argument order.
We can further refine the above example to avoid the mov instruction:
use std::arch::asm;
We can see that inout is used to specify an argument that is both input and output. This is different from
specifying an input and output separately in that it is guaranteed to assign both to the same register.
It is also possible to specify different variables for the input and output parts of an inout operand: 490
use std::arch::asm;
let x: u64 = 3;
let y: u64;
unsafe {
asm!("add {0}, 5", inout(reg) x => y);
}
assert_eq!(y, 8);
Here is an example where inlateout cannot be used in release mode or other optimized cases:
use std::arch::asm;
In unoptimized cases (e.g. Debug mode), replacing inout(reg) a with inlateout(reg) a in the above
example can continue to give the expected result. However, with release mode or other optimized cases,
using inlateout(reg) a can instead lead to the final value a = 16 , causing the assertion to fail.
This is because in optimized cases, the compiler is free to allocate the same register for inputs b and c
since it knows that they have the same value. Furthermore, when inlateout is used, a and c could be
allocated to the same register, in which case the first add instruction would overwrite the initial load from
variable c . This is in contrast to how using inout(reg) a ensures a separate register is allocated for a .
However, the following example can use inlateout since the output is only modified after all input
registers have been read:
use std::arch::asm; 491
As you can see, this assembly fragment will still work correctly if a and b are assigned to the same
register.
use std::arch::asm;
In this example we call the out instruction to output the content of the cmd variable to port 0x64 . Since
the out instruction only accepts eax (and its sub registers) as operand we had to use the eax constraint
specifier.
Note: unlike other operand types, explicit register operands cannot be used in the template string:
you can't use {} and should write the register name directly instead. Also, they must appear at the
end of the operand list after all other operand types.
unsafe {
asm!(
// The x86 mul instruction takes rax as an implicit input and writes
// the 128-bit result of the multiplication to rax:rdx.
"mul {}",
in(reg) a,
inlateout("rax") b => lo,
lateout("rdx") hi
);
}
This uses the mul instruction to multiply two 64-bit inputs with a 128-bit result. The only explicit
operand is a register, that we fill from the variable a . The second operand is implicit, and must be the rax
register, which we fill from the variable b . The lower 64 bits of the result are stored in rax from which we
fill the variable lo . The higher 64 bits are stored in rdx from which we fill the variable hi .
Clobbered registers
In many cases inline assembly will modify state that is not needed as an output. Usually this is either
because we have to use a scratch register in the assembly or because instructions modify state that we
don't need to further examine. This state is generally referred to as being "clobbered". We need to tell the
compiler about this since it may need to save and restore this state around the inline assembly block.
use std::arch::asm; 493
fn main() {
// three entries of four bytes each
let mut name_buf = [0_u8; 12];
// String is stored as ascii in ebx, edx, ecx in order
// Because ebx is reserved, the asm needs to preserve the value of it.
// So we push and pop it around the main asm.
// 64 bit mode on 64 bit processors does not allow pushing/popping of
// 32 bit registers (like ebx), so we have to use the extended rbx register instead.
unsafe {
asm!(
"push rbx",
"cpuid",
"mov [rdi], ebx",
"mov [rdi + 4], edx",
"mov [rdi + 8], ecx",
"pop rbx",
// We use a pointer to an array for storing the values to simplify
// the Rust code at the cost of a couple more asm instructions
// This is more explicit with how the asm works however, as opposed
// to explicit register outputs such as `out("ecx") val`
// The *pointer itself* is only an input even though it's written behind
in("rdi") name_buf.as_mut_ptr(),
// select cpuid 0, also specify eax as clobbered
inout("eax") 0 => _,
// cpuid clobbers these registers too
out("ecx") _,
out("edx") _,
);
}
In the example above we use the cpuid instruction to read the CPU manufacturer ID. This instruction
writes to eax with the maximum supported cpuid argument and ebx , edx , and ecx with the CPU
manufacturer ID as ASCII bytes in that order.
Even though eax is never read we still need to tell the compiler that the register has been modified so that
the compiler can save any values that were in these registers before the asm. This is done by declaring it as
an output but with _ instead of a variable name, which indicates that the output value is to be discarded.
This code also works around the limitation that ebx is a reserved register by LLVM. That means that LLVM
assumes that it has full control over the register and it must be restored to its original state before exiting
the asm block, so it cannot be used as an input or output except if the compiler uses it to fulfill a general
register class (e.g. in(reg) ). This makes reg operands dangerous when using reserved registers as we
could unknowingly corrupt our input or output because they share the same register.
To work around this we use rdi to store the pointer to the output array, save ebx via push , read from
ebx inside the asm block into the array and then restore ebx to its original state via pop . The push and
pop use the full 64-bit rbx version of the register to ensure that the entire register is saved. On 32 bit
targets the code would instead use ebx in the push / pop .
This can also be used with a general register class to obtain a scratch register for use inside the asm code:494
use std::arch::asm;
use std::arch::asm;
In some cases, fine control is needed over the way a register name is formatted when inserted into the
template string. This is needed when an architecture's assembly language has several names for the same
register, each typically being a "view" over a subset of the register (e.g. the low 32 bits of a 64-bit register).
By default the compiler will always choose the name that refers to the full register size (e.g. rax on x86-
64, eax on x86, etc).
This default can be overridden by using modifiers on the template string operands, just like you would with
format strings:
use std::arch::asm;
unsafe {
asm!("mov {0:h}, {0:l}", inout(reg_abcd) x);
}
assert_eq!(x, 0xabab);
In this example, we use the reg_abcd register class to restrict the register allocator to the 4 legacy x86
registers ( ax , bx , cx , dx ) of which the first two bytes can be addressed independently.
Let us assume that the register allocator has chosen to allocate x in the ax register. The h modifier will
emit the register name for the high byte of that register and the l modifier will emit the register name for
the low byte. The asm code will therefore be expanded as mov ah, al which copies the low byte of the
value into the high byte.
If you use a smaller data type (e.g. u16 ) with an operand and forget to use template modifiers, the
compiler will emit a warning and suggest the correct modifier to use.
use std::arch::asm;
fn load_fpu_control_word(control: u16) {
unsafe {
asm!("fldcw [{}]", in(reg) &control, options(nostack));
}
}
Labels 496
Any reuse of a named label, local or otherwise, can result in an assembler or linker error or may cause other
strange behavior. Reuse of a named label can happen in a variety of ways including:
explicitly: using a label more than once in one asm! block, or multiple times across blocks.
implicitly via inlining: the compiler is allowed to instantiate multiple copies of an asm! block, for
example when the function containing it is inlined in multiple places.
implicitly via LTO: LTO can cause code from other crates to be placed in the same codegen unit, and
so could bring in arbitrary labels.
As a consequence, you should only use GNU assembler numeric local labels inside inline assembly code.
Defining symbols in assembly code may lead to assembler and/or linker errors due to duplicate symbol
definitions.
Moreover, on x86 when using the default Intel syntax, due to an LLVM bug, you shouldn't use labels
exclusively made of 0 and 1 digits, e.g. 0 , 11 or 101010 , as they may end up being interpreted as binary
values. Using options(att_syntax) will avoid any ambiguity, but that affects the syntax of the entire
asm! block. (See Options, below, for more on options .)
use std::arch::asm;
let mut a = 0;
unsafe {
asm!(
"mov {0}, 10",
"2:",
"sub {0}, 1",
"cmp {0}, 3",
"jle 2f",
"jmp 2b",
"2:",
"add {0}, 2",
out(reg) a
);
}
assert_eq!(a, 5);
This will decrement the {0} register value from 10 to 3, then add 2 and store it in a .
First, that the same number can be used as a label multiple times in the same inline block.
Second, that when a numeric label is used as a reference (as an instruction operand, for example), the
suffixes “b” (“backward”) or ”f” (“forward”) should be added to the numeric label. It will then refer to
the nearest label defined by this number in this direction.
Options
By default, an inline assembly block is treated the same way as an external FFI function call with a custom
calling convention: it may read/write memory, have observable side effects, etc. However, in many cases it
is desirable to give the compiler more information about what the assembly code is actually doing so that497
it
can optimize better.
use std::arch::asm;
Options can be provided as an optional final argument to the asm! macro. We specified three options here:
pure means that the asm code has no observable side effects and that its output depends only on its
inputs. This allows the compiler optimizer to call the inline asm fewer times or even eliminate it
entirely.
nomem means that the asm code does not read or write to memory. By default the compiler will
assume that inline assembly can read or write any memory address that is accessible to it (e.g.
through a pointer passed as an operand, or a global).
nostack means that the asm code does not push any data onto the stack. This allows the compiler
to use optimizations such as the stack red zone on x86-64 to avoid stack pointer adjustments.
These allow the compiler to better optimize code using asm! , for example by eliminating pure asm!
blocks whose outputs are not needed.
See the reference for the full list of available options and their effects.
498
499
Compatibility
The Rust language is fastly evolving, and because of this certain compatibility issues can arise, despite
efforts to ensure forwards-compatibility wherever possible.
Raw identifiers
500
501
Raw identifiers
Rust, like many programming languages, has the concept of "keywords". These identifiers mean something
to the language, and so you cannot use them in places like variable names, function names, and other
places. Raw identifiers let you use keywords where they would not normally be allowed. This is
particularly useful when Rust introduces new keywords, and a library using an older edition of Rust has a
variable or function with the same name as a keyword introduced in a newer edition.
For example, consider a crate foo compiled with the 2015 edition of Rust that exports a function named
try . This keyword is reserved for a new feature in the 2018 edition, so without raw identifiers, we would
have no way to name the function.
fn main() {
foo::try();
}
fn main() {
foo::r#try();
}
502
503
Meta
Some topics aren't exactly relevant to how you program runs but provide you tooling or infrastructure
support which just makes things better for everyone. These topics include:
Documentation: Generate library documentation for users via the included rustdoc .
Playground: Integrate the Rust Playground in your documentation.
504
505
Documentation
Use cargo doc to build documentation in target/doc , cargo doc --open will automatically open it in
your web browser.
Use cargo test to run all tests (including documentation tests), and cargo test --doc to only run
documentation tests.
Doc comments
Doc comments are very useful for big projects that require documentation. When running rustdoc , these
are the comments that get compiled into documentation. They are denoted by a /// , and support
Markdown.
1 #![crate_name = "doc"]
2
3 /// A human being is represented here
4 pub struct Person {
5 /// A person must have a name, no matter how much Juliet may hate it
6 name: String,
7 }
8
9 impl Person {
10 /// Creates a person with the given name.
11 ///
12 /// # Examples
13 ///
14 /// ```
15 /// // You can have rust code between fences inside the comments
16 /// // If you pass --test to `rustdoc`, it will even test it for you!
17 /// use doc::Person;
18 /// let person = Person::new("name");
19 /// ```
20 pub fn new(name: &str) -> Person {
21 Person {
22 name: name.to_string(),
23 }
24 }
25
26 /// Gives a friendly hello!
27 ///
28 /// Says "Hello, [name](Person::name)" to the `Person` it is called on.
29 pub fn hello(&self) {
30 println!("Hello, {}!", self.name);
31 }
32 }
33
34 fn main() {
35 let john = Person::new("John");
36
37 john.hello();
38 }
To run the tests, first build the code as a library, then tell rustdoc where to find the library so it can link it 506
into each doctest program:
Doc attributes
Below are a few examples of the most common #[doc] attributes used with rustdoc .
inline
#[doc(inline)]
pub use bar::Bar;
no_inline
hidden
For documentation, rustdoc is widely used by the community. It's what is used to generate the std library
docs.
See also:
Playground
The Rust Playground is a way to experiment with Rust code through a web interface.
1 fn main() {
2 println!("Hello World!");
3 }
This allows the reader to both run your code sample, but also modify and tweak it. The key here is the
adding of the word editable to your codefence block separated by a comma.
```rust,editable
//...place your code here
```
Additionally, you can add ignore if you want mdbook to skip your code when it builds and tests.
```rust,editable,ignore
//...place your code here
```
#![doc(html_playground_url = "https://play.rust-lang.org/")]
//! ```
//! println!("Hello World");
//! ```
See also: