It's always helpful to remember that not all BASH operations are created equal. In dicing with configuring an environment I failed to recall the first rule of BASH scripting: Know Your Subshells!
In this exampe n
is always 0. This is because the increment operation is in a subshell, and variables do not propagate back to their parent shells.
n=0
cat file | while read line; do (( n++ )); done
echo $n
Environment variables, as well as the current directory, is only inherited parent-to-child. Changes to a child’s environment are not reflect in the parent. Any time a shell forks, changes done in the forked process are confined to that process and its children.
From the Advanced Bash-Scripting Guide on Subshells:
In general, an external command in a script forks off a subprocess, whereas a Bash builtin does not. For this reason, builtins execute more quickly and use fewer system resources than their external command equivalents.
So, where does BASH create a subshell?
- executing a program or script
- for every invocation in a pipeline
- any time a new shell is started
- for background commands and coprocesses
- in-shell command expansion
- in-shell process substitution
- explicit subshells
So to prevent a subshell being forked, use exec or a builtin.
To wit:
# Executing other programs or scripts
./setmyfoo
foo=bar ./something
# Anywhere in a pipeline in Bash
true | foo=bar | true
# In any command that executes new shells
awk '{ system("foo=bar") }'h
find . -exec bash -c 'foo=bar' \;
# In backgrounded commands and coprocs:
foo=bar &
coproc foo=bar
# In command expansion
true "$(foo=bar)"
# In process substitution
true < <(foo=bar)
# In commands explicitly subshelled with ()
( foo=bar )
Source: Vidar’s Blog » Why Bash is like that: Subshells - contains further examples