Variables, Expressions and Statements
One of the most powerful features of a programming language is the ability to manipulate variables. A variable is a name that refers to a value.
Assignment Statements
An assignment statement creates a new variable and gives it a value:
>>> message = 'And now for something completely different'
>>> n = 17
>>> pi = 3.1415926535897932
This example makes three assignments. The first assigns a string to a new variable named message
; the second gives the integer 17 to n
; the third assigns the (approximate) value of π to pi
.
A common way to represent variables on paper is to write the name with an arrow pointing to its value. This kind of figure is called a state diagram because it shows what state each of the variables is in (think of it as the variable’s state of mind). Below is example of a state diagram:
Variable Names
Programmers generally choose names for their variables that are meaningful—they document what the variable is used for.
Variable names can be as long as you like. They can contain both letters and numbers, but they can’t begin with a number. It is legal to use uppercase letters, but it is conventional to use only lower case for variables names.
The underscore character, _
, can appear in a name. It is often used in names with multiple words, such as your_name
or airspeed_of_unladen_swallow
.
If you give a variable an illegal name, you get a syntax error:
>>> 76trombones = 'big parade'
SyntaxError: invalid syntax
>>> more@ = 1000000
SyntaxError: invalid syntax
>>> class = 'Advanced Theoretical Zymurgy'
SyntaxError: invalid syntax
76trombones
is illegal because it begins with a number. more@
is illegal because it contains an illegal character, @
. But what’s wrong with class
?
It turns out that class
is one of Python’s keywords. The interpreter uses keywords to recognize the structure of the program, and they cannot be used as variable names.
Python 3 has these keywords:
False class finally is return
None continue for lambda try
True def from nonlocal while
and del global not with
as elif if or yield
assert else import pass
break except in raise
You don’t have to memorize this list. In most development environments, keywords are displayed in a different color; if you try to use one as a variable name, you’ll know.
Expressions and Statements
An expression is a combination of values, variables, and operators. A value all by itself is considered an expression, and so is a variable, so the following are all legal expressions:
>>> 42
42
>>> n
17
>>> n + 25
42
When you type an expression at the prompt, the interpreter evaluates it, which means that it finds the value of the expression. In this example, n has the value 17 and n + 25 has the value 42.
A statement is a unit of code that has an effect, like creating a variable or displaying a value.
>>> n = 17
>>> print(n)
The first line is an assignment statement that gives a value to n
. The second line is a print statement that displays the value of n
.
When you type a statement, the interpreter executes it, which means that it does whatever the statement says. In general, statements don’t have values.
Script Mode
So far we have run Python in interactive mode, which means that you interact directly with the interpreter. Interactive mode is a good way to get started, but if you are working with more than a few lines of code, it can be clumsy. In Thonny, we have been using the shell pane to execute Python statements in interactive mode.
The alternative is to save code in a file called a script and then run the interpreter in script mode to execute the script. By convention, Python scripts have names that end with .py
. In Thonny, the editor pane (the main part of the window) is for editing scripts. You can type Python statements in this editor pane, save it to your computer with a .py
at the end of its name, and then run it by clicking the green “play” button in the toolbar. (You can also run the script by clicking Run -> Run current script in the top menu.)
Because Python provides both modes, you can test bits of code in interactive mode before you put them in a script. But there are differences between interactive mode and script mode that can be confusing.
For example, if you are using Python as a calculator, you might type
>>> miles = 26.2
>>> miles * 1.61
42.182
The first line assigns a value to miles
, but it has no visible effect. The second line is an expression, so the interpreter evaluates it and displays the result. It turns out that a marathon is about 42 kilometers.
But if you type the same code into a script and run it, you get no output at all. In script mode an expression, all by itself, has no visible effect. Python evaluates the expression, but it doesn’t display the result. To display the result, you need a print statement like this:
miles = 26.2
print(miles * 1.61)
This behavior can be confusing at first. To check your understanding, type the following statements in the Python interpreter and see what they do:
5
x = 5
x + 1
Now put the same statements in a script and run it. What is the output? Modify the script by transforming each expression into a print statement and then run it again.
Important: The above distinction is important. You should actually complete the exercise above before continuing.
Order of Operations
When an expression contains more than one operator, the order of evaluation depends on the order of operations. For mathematical operators, Python follows mathematical convention. The acronym PEMDAS is a useful way to remember the rules:
- Parentheses have the highest precedence and can be used to force an expression to evaluate in the order you want. Since expressions in parentheses are evaluated first,
2 * (3-1)
is4
, and(1+1)**(5-2)
is8
. You can also use parentheses to make an expression easier to read, as in(minute * 100) / 60
, even if it doesn’t change the result. - Exponentiation has the next highest precedence, so
1 + 2**3
is9
, not27
, and2 * 3**2
is18
, not36
. - Multiplication and Division have higher precedence than Addition and Subtraction. So
2*3-1
is5
, not4
, and6+4/2
is8
, not5
.
Operators with the same precedence are evaluated from left to right (except exponentiation). So in the expression degrees / 2 * pi
, the division happens first and the result is multiplied by pi
. To divide by 2 π, you can use parentheses or write degrees / 2 / pi
.
I don’t work very hard to remember the precedence of operators. If I can’t tell by looking at the expression, I use parentheses to make it obvious.
String Operations
In general, you can’t perform mathematical operations on strings, even if the strings look like numbers, so the following are illegal:
'chinese'-'food'
'eggs'/'easy'
'third'*'a charm'
But there are two exceptions, +
and *
.
The +
operator performs string concatenation, which means it joins the strings by linking them end-to-end. For example:
>>> first = 'throat'
>>> second = 'warbler'
>>> first + second
throatwarbler
The *
operator also works on strings; it performs repetition. For example, 'Spam'*3
is 'SpamSpamSpam'
. If one of the values is a string, the other has to be an integer.
This use of +
and *
makes sense by analogy with addition and multiplication. Just as 4*3
is equivalent to 4+4+4
, we expect 'Spam'*3
to be the same as 'Spam'+'Spam'+'Spam'
, and it is. On the other hand, there is a significant way in which string concatenation and repetition are different from integer addition and multiplication. Can you think of a property that addition has that string concatenation does not?
Comments
As programs get bigger and more complicated, they get more difficult to read. Formal languages are dense, and it is often difficult to look at a piece of code and figure out what it is doing, or why.
For this reason, it is a good idea to add notes to your programs to explain in natural language what the program is doing. These notes are called comments, and they start with the #
symbol:
# compute the percentage of the hour that has elapsed
percentage = (minute * 100) / 60
In this case, the comment appears on a line by itself. You can also put comments at the end of a line:
percentage = (minute * 100) / 60 # percentage of an hour
Everything from the #
to the end of the line is ignored—it has no effect on the execution of the program.
Comments are most useful when they document non-obvious features of the code. It is reasonable to assume that the reader can figure out what the code does; it is more useful to explain why.
This comment is redundant with the code and useless:
v = 5 # assign 5 to v
This comment contains useful information that is not in the code:
v = 5 # velocity in meters/second.
Good variable names can reduce the need for comments, but long names can make complex expressions hard to read, so there is a tradeoff.
Debugging
Three kinds of errors can occur in a program: syntax errors, runtime errors, and semantic errors. It is useful to distinguish between them in order to track them down more quickly.
- Syntax Error
- “Syntax” refers to the structure of a program and the rules about that structure. For example, parentheses have to come in matching pairs, so (1 + 2) is legal, but 8) is a syntax error. If there is a syntax error anywhere in your program, Python displays an error message and quits, and you will not be able to run the program. During the first few weeks of your programming career, you might spend a lot of time tracking down syntax errors. As you gain experience, you will make fewer errors and find them faster.
- Runtime Error
- The second type of error is a runtime error, so called because the error does not appear until after the program has started running. These errors are also called exceptions because they usually indicate that something exceptional (and bad) has happened. Runtime errors are rare in the simple programs you will see in the first few chapters, so it might be a while before you encounter one.
- Semantic Error
- The third type of error is “semantic”, which means related to meaning. If there is a semantic error in your program, it will run without generating error messages, but it will not do the right thing. It will do something else. Specifically, it will do what you told it to do. Identifying semantic errors can be tricky because it requires you to work backward by looking at the output of the program and trying to figure out what it is doing.
Glossary
- variable
- A name that refers to a value.
- assignment
- A statement that assigns a value to a variable.
- state diagram
- A graphical representation of a set of variables and the values they refer to.
- keyword
- A reserved word that is used to parse a program; you cannot use keywords like if, def, and while as variable names.
- operand
- One of the values on which an operator operates.
- expression
- A combination of variables, operators, and values that represents a single result.
- evaluate
- To simplify an expression by performing the operations in order to yield a single value.
- statement
- A section of code that represents a command or action. So far, the statements we have seen are assignments and print statements.
- execute
- To run a statement and do what it says.
- interactive mode
- A way of using the Python interpreter by typing code at the prompt.
- script mode
- A way of using the Python interpreter to read code from a script and run it.
- script
- A program stored in a file.
- order of operations
- Rules governing the order in which expressions involving multiple operators and operands are evaluated.
- concatenate
- To join two operands end-to-end.
- comment
- Information in a program that is meant for other programmers (or anyone reading the source code) and has no effect on the execution of the program.
- syntax error
- An error in a program that makes it impossible to parse (and therefore impossible to interpret).
- exception
- An error that is detected while the program is running.
- semantics
- The meaning of a program.
- semantic error
- An error in a program that makes it do something other than what the programmer intended.
Self Checks
Recall that at the end of most readings, a few exercises will be included to give you a chance to apply what you’ve learned. When completing the exercises, it is important to have Thonny open and use it to work through the exercises. If you don’t have Thonny installed yet, go to the Installing Thonny page to install it.
Warning! After each reading, before coming to the corresponding class, you should attempt all of the self check exercises. You may be asked to share your answer/approach in class.
Check 1
- We’ve seen that
n = 42
is legal. What about42 = n
? - How about
x = y = 1
? - In some languages every statement ends with a semi-colon,
;
. What happens if you put a semi-colon at the end of a Python statement? - What if you put a period at the end of a statement?
- In math notation you can multiply
x
andy
like this:x y
. What happens if you try that in Python?
Check 2
Practice using the Python interpreter as a calculator:
- The volume of a sphere with radius r is 4/3 π r3. What is the volume of a sphere with radius 5?
- Suppose the cover price of a book is \$24.95, but bookstores get a 40% discount. Shipping costs \$3 for the first copy and 75 cents for each additional copy. What is the total wholesale cost for 60 copies?
- If I leave my house at 6:52 am and run 1 mile at an easy pace (8:15 per mile), then 3 miles at tempo (7:12 per mile) and 1 mile at easy pace again, what time do I get home for breakfast?
Acknowledgment:
This reading was originally written by Allen Downey in his open source book Think Python 2e. Downey's book is licensed under the GNU Free Documentation License, which allows users to copy, modify, and distribute the book.
This reading was modified by Titus H. Klinge in 2021 and presented above under the same license in order to better serve the students of this course.