Streamkeepers’ Physical Habitat Sampling Plan: What Now?
Questions We
Need to Answer SOON
Briefly, there is reason to think that Streamkeepers
should perhaps discontinue its current set of physical-habitat monitoring
(P-hab) parameters, and replace it with another set. However, we haven’t had enough time to
hold this discussion with our technical advisors in time to decide exactly what
new set (if any) to put in place this summer. So for now, our questions are more
limited:
1.
Should we continue
measuring our current set of P-hab parameters this summer, even if there’s a
good chance we’ll be changing sets next year? (Or should we measure them at new sites
where haven’t measured P-hab before?)
2.
Should we explore
alternative P-hab monitoring systems (between now and next summer), and if so,
which ones?
3.
If we don’t monitor
P-hab this summer, should we add an additional water-chem/flow monitoring
session in July to the ones we already do in August and in
mid-Sept—mid-Oct? (The rationale
would be that the volunteers would have less work to do, and the time saved by
not monitoring P-hab could be spent doing additional water quality monitoring in
July, which in some cases might be a critical water-quality
month.)
We need answers to both questions by
mid-June.
Genesis of
Our Current P-hab Suite
Streamkeepers was created primarily to engage volunteers
in gathering useful data about
When Streamkeepers inherited this program in 1999, we
set about revising the monitoring program.
Our technical advisors (mostly local agency folks who were presumably the
primary end-users of the data) wanted us to measure the chemical, biological,
and physical health of streams at targeted sites, just as we had under the
former project, only with more accurate and precise measurements. For water-chemistry and flow
measurements, this was easily accomplished by switching from educational test
kits to electronic meters. For
biological monitoring, we adopted the genus-level ten-metric Benthic Index of
Biological Integrity for the Puget Sound Lowlands, which is backed by extensive
publications and has wide acceptance in scientific and regulatory
circles.
P-hab monitoring was a more difficult matter. We needed a program that would enable
valid evaluation of the physical health of specific reaches of
1.
Many P-hab
assessment systems involved numerous judgment-calls on the part of the
evaluator, and even if adequate precision were possible with trained,
professional technicians (a claim that has been disputed), that same precision
did not seem possible with our volunteers.
(Example: Barbour et al., Rapid Bioassessment Protocols, USEPA
1999; the 1989 Plafkin et al. version; and Hankin and Reeves’ visual habitat
area estimation method.)
2.
Other systems were
more quantitative and involved few or no judgment calls, but these were complex
enough that they seemed beyond the scope of what volunteers could be expected to
do, given the time-requirements for training and execution. (Example:
3.
Programs designed
for volunteers yielded data lacking adequate precision, accuracy, and
meaningfulness. (Example: Adopt-A-Stream Foundation’s Streamkeeper’s Field
Guide.)
We decided to base our P-hab monitoring program on work
done at the
|
Parameter |
Method |
Source |
Equipment |
Frequency |
|
Gradient |
“Pea-shooter” |
|
Stadia rod and sight
level |
1x/year |
|
Cross-section |
Permanent monuments;
year-to year comparisons |
Booth & Comings
1998 |
Monuments, tape,
string, stadia rod |
1x/year |
|
Erosion
and revetment |
Observation &
measurement |
Scholz & Booth
1999; City of |
Tape
measure |
1x/year |
|
Substrate
|
100-pebble
count |
Kondolf 1997; Scholz
and Booth 1999 |
Half-phi
ruler |
1x/year |
|
Pools |
Tally &
measurement of residual depth |
TFW 1999, simplified;
Scholz and Booth 1999; |
Stadia
rod |
1x/year |
|
Large woody debris
(LWD) |
Tally by type and
zone—minimum sizes, but size not measured |
TFW 1999, simplified;
Scholz and Booth 1999; |
Tape measure,
ruler |
1x/year |
|
Canopy closure
percentage |
Single-point |
TFW 1998,
simplified |
Spherical
densiometer |
2x/year (smr &
wtr) |
|
Canopy type
percentages |
Visual estimate of
overstory cover percentages in reach area |
|
None; estimates are
done in 3 percentage classes |
1x/year |
|
Conifer
stems |
Density count of
any-size stems |
|
Reach map, tape,
compass to mark boundaries |
Every 5
years |
It should be noted that at the recommendation of our
technical advisors, our reaches were picked not randomly, but targeted in
consideration of a number of factors, including intuitive representativeness of
the suite of sites for the stream as a whole and of the particular site for its
location in the watershed; adjacent land-use; access; and distance from channel
alterations. So our data can only
properly be interpreted to represent their particular collection
sites.
Draft
Physical Habitat Index (PHI)
After gathering five years’ worth of data, we drafted a
7-metric PHI, using the following parameters calculated using Streamkeeper data
(to see our explanatory paper or scores, see http://www.clallam.net/streamkeepers/html/physical_habitat_index.htm),
and comparing values for these parameters to established reference standards for
healthy streams:
We found our draft PHI scores to correlate with both
B-IBI ratings and biologists’ intuitive P-hab ratings of sites, but the
correlations were only strong at the extremes—cases of extremely good or bad
habitat. In the middle, the
correlations were weak. Some of the
non-correlation relates to reality:
P-hab influences but does not in itself determine biological
integrity. Some of the correlation
could be improved if our PHI were calibrated using standard statistical
techniques. And some further
explanatory power might be gained by adding additional metrics for which we have
data: pebble embeddedness, invasive
aquatic weeds, bank stability class, and cross-section instability. But these tasks would require
considerable time, which we currently don’t have. (We do have some volunteers with
sufficient scientific background to take on these tasks, and one of them might
take on this task if we “pushed it.”)
It could be argued that even if we decide to abandon our current suite of
P-hab protocols, we should complete the development of this PHI, in the attempt
to synthesize five years’ worth of data.
Weaknesses of
Our Current P-hab Monitoring Program
There are a number of weaknesses in our current suite of
P-hab protocols, which might be grounds for abandoning or altering it. The first weakness is our suite’s
uniqueness. After Booth published
his 1999 suggestions, we assumed they would be followed by a spate of
graduate-level research exploring Booth’s parameters and their usefulness in
P-hab assessment, culminating in the development of a PHI. That has turned out only partially to be
the case, and though our PHI was significantly aided by Booth’s students’ work,
none of them attempted to devise a PHI using Booth’s original set of parameters.
Since we aren’t using someone
else’s standard technique, we don’t have other scores to compare ours to, and we
would find greater acceptance of our work if we used a technique that had been
developed, vetted, and published by some prominent arm of government or
academia. For example, the
Washington State Comprehensive Monitoring Strategy for Watershed Health and
Salmon Recovery (Dec. 2002) calls for monitoring of freshwater habitat status
and trends using methods similar to the Environmental Monitoring and Assessment
Program (EMAP) of the U.S. Environmental Protection Agency, which has been
extensively applied in Pacific Northwest streams.
Our second great weakness is the length of our
monitoring reaches. When we began
in 1999, we inherited reaches that had already been established by our
predecessor program. These were
100’ long. Reaches established
later tended to be 200’ or more, but even that length is considerably shorter
than that recommended by scientists for a representative reach length—for
example, EMAP recommends a minimum sample reach length of 150 m. or 40 times the
baseflow wetted width, whichever is longer.
One solution would be to just lengthen the reaches, but
that would involve some complications:
Further
weaknesses in our P-hab protocols are described in the table
below:
|
Parameter/Protocol |
Problems Noted |
Proposed Changes |
|
Pebble count:
Surface sediment sampled annually in half-phi size
classes |
Measures surface armoring
only, not substrate, which is critical for spawning/rearing; and cannot
accurately measure fines < 8 mm |
Create special team to
perform core-sampling in selected cases; also, sample pebbles at regular
intervals along a longer reach, as in EMAP |
|
Large woody debris:
Tallied by minimum-size threshold and channel intrusion
zone |
LWD volume data is needed
to accurately calculate Bed Substrate Stability for our PHI. (We currently estimate this
factor.) |
Measure LWD volume within
the bankfull channel (as in EMAP) |
|
Pools:
Tallied along with residual depth, by minimum-depth threshold
according to stream size |
Residual-pool volume data
is needed to accurately calculate Relative Bed Stability for our PHI. (We currently estimate this
factor.) |
Eliminate pools tally and
add a thalweg-profile component in order to calculate residual-pool area
(as in EMAP) |
|
Cross section:
Measured annually at one spot per site, between permanent
monuments |
Micro-location may impact
the stability or instability of the particular spot; monuments can go
missing |
Eliminate permanent
monuments and measure multiple cross sections at regular intervals along
the monitoring reach (as in EMAP) |
|
Gradient:
Measured annually along a 50-100’ line within the monitoring
reach |
·
The site
selected may poorly reflect overall gradient within the larger
reach ·
Basic
measurement problems |
·
Measure
gradient (& thalweg profile) along entire length of longer reaches (as
in EMAP) ·
Use surveying
techniques |
|
Canopy closure: Measured biannually at a single point within the
reach |
Single point may bias the
data |
Measure canopy closure at
regular intervals along the monitoring reach (as in
EMAP) |
|
Conifer stem count: All
conifer stems of any size are counted within site length &
width |
Sites are too short for
adequate reach characterization (see above). Stems aren’t measured, so resulting
riparian habitat info is coarse & this count isn’t used in our draft
PHI. Also, this is the only current parameter that requires a reach map, a
protocol with its own problems. |
Discontinue; implement a
random-plot procedure assessing species & size by category (as in
EMAP); or implement remote-sensing
analysis |
Broader
Questions to Consider
In order
to decide on what suite of P-hab parameters to measure, we need to pose the
following questions:
1.
Should Streamkeepers do P-hab monitoring at all? If so, why?
The
stated goals of our monitoring include status, trends, red-flags, problem
investigation, and restoration planning and effectiveness monitoring. Are all of those goals still valid, and
should P-hab monitoring be conducted to help support those goals? Do the goals need to be
prioritized?
2.
Where and when should we monitor P-hab, and with what
parameters?
These
questions address overall sampling design.
Our technical advisors have consistently asked us to do targeted rather
than randomized sampling, although some have lately begun to argue for the
latter. For example, Bruce Crawford
of the Statewide Monitoring Oversight Committee is involved in an effort to
devise a statewide WRIA-level randomized monitoring framework that would
designate about 20 randomized sites per WRIA, whereby various entities
(including volunteer groups such as Streamkeepers) could partner to monitor a
full suite of parameters (physical, chemical, and biological) at the framework
sites. It may be that eventually
Streamkeepers could perform both targeted and randomized sampling, perhaps with
our entire suite of parameters—biological, chemical, and physical—in order to
meet the valid goals of both approaches.
3.
How should we monitor those
parameters?
In order
to simplify the myriad of sampling protocols and suites of protocols available,
we’ll break the choices down to a few basic options, with pros and cons listed
for each approach:
A.
Streamkeepers’ current P-hab
suite
We could continue using our present suite of P-hab
protocols, possibly lengthening reaches and making some/all of the modifications
suggested in the table above.
·
Pros:
·
Would require the
least retooling of our program.
·
New data could be
correlated with old data, thus salvaging a five-year data
set.
·
PHI development
could continue, possibly resulting in elimination of parameters with redundancy
or high signal/noise ratio.
·
Cons: Listed above under “Weaknesses.” Also, PHI calibration could lead up a
blind alley if our suite of parameters measured turns out to be
inadequate.
B.
Refined
Some of Booth’s students at UW continued working along
the lines of his 1999-2000 suggestions.
Sossa and Booth (2004) produced a Physical In-Stream Condition Index
(PSCI) that ignores the riparian area and includes just 4 parameters: bank stability and sediment cementation
(categorized qualitatively), and numbers of LWD and pools. This index provides a coarse
discrimination of “low,” “intermediate,” or “high” quality physical in-stream
condition. McBride and Booth (2003)
produced an index that included bank stability, cementation, and LWD, plus
embeddedness and channel complexity (categorized qualitatively), and a
comparison of channel cross-section with watershed size. Each of the 6 metrics uses a 4-point
scale. This index was found to
correlate well with measures of biological integrity and landscape
cover.
·
Pros:
·
These are truly
rapid assessment protocols, designed for assessment at a broad geographic
scale.
·
Multimetric indexes
have been developed, at least in draft form.
·
Cons:
·
Their use requires
a good deal of subjective assessment, which would run into precision problems
with our large volunteer group.
·
Assessments are at
a fairly crude level, and trend analysis would not be very
discriminating.
·
There is no single
set of protocols broadly accepted by a variety of
agencies.
C.
Nationally-
or regionally-developed P-hab suite, such as
EMAP
Bruce Crawford, the statewide Monitoring Oversight
Committee coordinator, is working with state agencies on modifications to EMAP
to improve efficiency and precision, most notably the use of remote-sensing
images to do riparian analysis. We
could coordinate with Bruce’s effort, get trained, and then train a sub-group of
our volunteers in EMAP protocols.
Phil Kaufman of EPA says that we could train volunteers in 2 days, and
that a team of 2 can monitor a reach in about 3.5 hours. If we could train 8-10 volunteers
willing to work 8 days apiece over the summer and fall, we could assess 30-50
reaches per year. These could be at
Streamkeepers’ already-targeted sites, new randomized sites, or a combination of
both.
·
Pros:
·
Data would be
consistent with federal and state databases, and therefore more likely to be
used at those levels.
·
Qualitative
assessments are minimized, and a subset of our volunteers could be trained to
gather high-quality data.
·
Protocols and data
sheets are well developed, and Oregon DEQ has programmed data-entry into
hand-held computers to incorporate data-validation and reduce data-entry time
and error.
·
EPA is working on
calibrated indexes of P-hab quality based on EMAP data.
·
Cons:
·
Local agencies have
little experience with EMAP.
·
Our five-year data
set might end up going for naught.
·
Implementing EMAP
would be a major initiative requiring a good deal of staff time. We would hopefully be able to get
trained and do pilot studies with our volunteers in 2005, and then begin
monitoring in 2006.
D.
Locally-developed P-hab suite
Local natural-resource professionals have developed
their own P-hab monitoring systems.
For example, the watershed assessment done on Siebert Creek in 2004 used
a modified TFW protocol developed by Derek Booth and Chris May, combining
qualitative and quantitative approaches, and including the following
parameters:
·
Categorization of
habitat units (pool, riffle, cascade, glide)
·
Qualitative
assessment of dominant spawning gravel particle size, embeddedness, and bank
stability
·
LWD count, key
pieces, and sizes
·
Pool depths plus
qualitative assessment
·
Fish passage
blockage assessment
·
Photo
documentation
·
Riparian assessment
using aerial photos
·
Other qualitative
observations relevant to habitat quality
When assessing restoration project effectiveness, local
project managers will often add stream-profile surveys using standard surveying
techniques.
·
Pros:
·
Many local
professionals are comfortable with this approach.
·
Cons:
·
A good deal of
qualitative assessment is required, and precision would be a
problem.
·
This assessment
system is not standardized and accepted across a broad variety of agencies, so
data comparability and communication may be a problem.
·
No attempt has been
made to develop a multimetric index using such a system.
·
Formal surveying
would be difficult and time-consuming for our volunteers.
Here’s our
opinion; what’s yours?
Given
our research to this point, we’re leaning toward EMAP as
modified by
Thanks, Ed and
Hannah
[1] See the following two documents:
Scholz, J. G. and D. B. Booth, 1999. Stream habitat assessment
protocols: An evaluation of
urbanizing watersheds in the
Scholz, J. G. and D. B. Booth, 2000. Monitoring urban streams: Strategies and protocols for humid-region lowland systems. Env. Monit. Ass. (in press). Available at the University of Washington Center for Water and Watershed Studies website, http://depts.washington.edu/cuwrm/research/monitoring.pdf.