Office of Operations
21st Century Operations Using 21st Century Technologies

Approaches to Presenting External Factors with Operations Performance Measures

APPENDIX. IDENTIFYING AND PRESENTING EXTERNAL FACTORS

This Appendix documents a state-of-the-practice review and statistical analysis conducted to identify key external factors that influence transportation system performance measures. The Appendix also presents key principles for information and visualization design.

STATE-OF-THE-PRACTICE REVIEW

Table 5 summarizes a sample of agencies that collect performance measures and have in recent performance reports or web based dashboards reported external factors broadly grouped into the categories listed above.  While there are more examples available, this selection is highlighted because they represent more complete sets of measures or attributes that are readily available and produced in a single document or dashboard.  Private sector freight and airline companies also plan for external factors however there is limited detailed information on the process and procedures or the measures they actually use documented in the literature.

Washington State DOT has published the Gray Notebook quarterly since 2001.  This document records selected performance measures from across the State as well as local areas of interest.  In 2010, six statewide transportation policy goals were reaffirmed in statute to guide the planning for operation and performance of and investment in the state's transportation system.  These six goals were in the areas of Preservation, Safety, Mobility, Environment, Stewardship, and Economic Vitality.  Biennial Transportation Attainment Reports along with the quarterly Gray Notebook assess progress toward the goals and contribute to the overall performance of the transportation system.

Florida DOT's Planning Division tracks transportation-related "Trends" described as information that could pose threats to successful implementation of the Florida Transportation Plan. This information can be used by Florida's decisionmakers, transportation professionals and the interested public to assist in understanding transportation-related issues and making wise and informed decisions.  Travel demand is measured with vehicle miles travelled and freight volume.  Other external factors presented include population, transportation funding, tourism, fatalities, housing, energy consumption, and mode of travel.

Table 5. Summary Table of Factors Being Recorded by Different Agencies.
Agency Document Travel Demand Indicators Economic Indicators Population and Housing Weather Safety Year
Washington State DOT Gray Notebook & Attainment Report Yes Yes No No Yes 2016
Florida DOT Fast Facts Yes Yes Yes No Yes 2016
Florida DOT Trends \ Dashboard Yes Yes Yes No Yes 2017
Metropolitan Transportation Commission Vital Signs Yes Yes Yes No Yes 2017
Chicago Metropolitan Agency for Planning Regional Economic Indicators Website No Yes Yes No No 2017
South Carolina DOT Performance Dashboard Yes Yes No No Yes 2017
Southern California Association of Governments Performance Measures Appendix to Regional Transportation Plan Yes Yes Yes No Yes 2016
Delaware Valley Regional Planning Commission Comprehensive Economic Development Strategy Performance Measure Website Yes Yes Yes No No 2016

The Metropolitan Transportation Commission (MTC) is the transportation planning and coordinating agency for the nine-county San Francisco Bay area. MTC developed Vital Signs, an interactive website that tracks factors related to transportation land use, the environment, the economy, and social equity.  Travel demand is measured with daily miles traveled and the site also tracks other factors such as population, mode of travel, unemployment, fatalities, home prices, income, and economic output (regional domestic product).

The Chicago Metropolitan Agency for Planning (CMAP) is the regional planning organization for the Northeastern Illinois counties of Cook, DuPage, Kane, Kendall, Lake, McHenry, and Will.  As a part of their comprehensive regional plan the agency develops coordinated strategies that help address issues related to transportation, housing, economic development, and other quality of life issues.  The CMAP website has multiple pages of statistics that track or provide a recent history of factor areas such as household income, unemployment, education, workforce participation, congestion, and freight.

South Carolina DOT produces a performance dashboard that displays statistics of different measures in the areas of economy, mobility, preservation, safety, and strategic planning.  Specific measures include, freight reliability and delay, interstate reliability and delay, fatalities and serious injuries, bicycle and pedestrian fatalities.

Southern California Association of Governments published a list of performance measures as an appendix to their Regional Transportation Plan/Sustainable Communities Strategy.  The Plan is a long-range visioning plan that balances future mobility and housing needs with economic, environmental and public health goals.  As a part of the vision, performance measures are identified as well as the measure definition and source.

The greater Philadelphia area MPO, Delaware Valley Regional Planning Commission, as a part of their Comprehensive Economic Development Strategy, tracks performance measures in the areas of unemployment, income, population, construction, mode of travel, vehicle miles travelled, air quality, tourism, housing affordability, job growth, regional domestic product, and volume and tonnage of freight.

STATISTICAL ANALYSIS OF POSSIBLE EXTERNAL FACTORS

To estimate the effect of external factors on highway performance measures both spatial and temporal dependencies need to be addressed accordingly. External factors such as social-economic changes or weather may be subject to spatial or geographical limitations. The temporal relationship between the change in an external factor and the manifestation of its effect in highway performance measures, as well as the persistence or duration of the effect, is another aspect. External factors may affect highway performance immediately, or may only have an effect after a lag time or after a period of persistent stimulus. Such a relationship is referred as dynamic relationship. The researchers used several statistical tools to address the multivariate dynamic process to:

  • Identify the external factors that have temporal and spatial impact on the performance measures.
  • Identify the factors that have lagged effects.
  • Quantify the relationships.

Multivariate methods for time series analyses such as vector autoregressive and state space models are widely used in econometrics. In addition to describing the time-varying response of dependent highway performance variables, these techniques enable a researcher to quantify the time-dependent effects of exogenous (input) factors on the endogenous (output) variables. Endogenous variables are the processes that are determined inside of the system of interest while the exogenous (or independent) variables are determined outside of the system. For example, the travel time index across the 51 metropolitan statistical areas is the multivariate endogenous factors, while the monthly traffic volume and gross domestic project of these areas are exogenous factors.

The relationship between the performance measures in this study and external factors are formulated in the following way:

Vector form:

(1)This equation shows a vector form of the equation used to evaluate the external factors.

Matrix form:

(2)This equation shows a matrix form of the equation used to evaluate the external factors.

Where,

  • Y subscript PM t is the performance measures vector on date t.
  • PM is the performance measure index (3 in total).
  • X subscript K t is the matrix of K external factors on date t.
  • X subscript K t-1 is the matrix of K external factors on date t-1.
  • MSA is the metropolitan statistical area index (51 in total).
  • Phi superscript 1 is the matrix of autoregressive coefficients where 1 refers to the number of past values (1 for one month, 2 for two months, etc.)
  • B superscript 0 is the matrix of coefficients of external factors whose present values influence the performance measures.
  • B superscript 1 is the matrix of coefficients of external factors whose past values influence the performance measures. These factors are referred as leading indicators.
  • E subscript t is the vector of unobserved error term.

Modeling the multivariate dynamic models can become computationally costly. Therefore, the researchers applied several multivariate tools to make the process relatively easier to execute. The steps taken to conduct the statistical analysis are described in Figure 10. Researchers used PCA to reduce the data dimensionality.(1) After reducing the multivariate data researchers have conducted Granger causality test to identify the most influential external factors affecting the performance measures.(2) This analysis was conducted for each performance measure individually. As the result of this process the researchers obtained the preliminary list of the influential external factors. Each external factor's present and lagged effects were examined as well.

This figure shows a graphic that shows that multiple external factors were analyzed against three performance measures: travel time index, planning time index, and congested hours.

Figure 10. Chart. Statistical Analysis Methodology.

To identify the final list of the influential external factors, the researchers selected the variables that were found to be influential for at least two performance measures. After identifying the list of five influential factors, authors conducted a Dynamic Panel Data (DPD) analysis of the three performance measures.(3)

Figure 11 shows the performance measures in this study across the three MSAs: Austin, Baltimore and Chicago. The time series plots indicate similarities among the same performance measures across the MSAs while the correlation plots indicate that the performance measures of different MSAs are correlated.

To observe the trend and seasonality, the time series were decomposed into the unobserved components using the state space models as shown in Figure 12. Note that the example only shows Austin data.

This figure shows multiple trend lines and correlation plots for three performance measures (travel time index (top), planning time index (middle), and congested hours (bottom)) in three cities: Austin, Baltimore, and Chicago.

Figure 11. Chart. Time Series and Scatter Plots of Performance Measures.

This figure shows seasonal trend and decomposition charts for three performance measures (travel time index (top), planning time index (middle), and congested hours (bottom)) in Austin.

Figure 12. Chart. Trend and Seasonality Decomposition of Performance Measures (Austin).

Selection of Most Influential External Factors

To select the influential factors, the dimension of multivariate data was reduced using the principal components analysis. Granger causality method was later applied to identify the most influential factors and their lagged effects on each performance measure.

PCA was used to reduce the dimension of the performance measures and the external factors. As shown in Figure 13, most of the variability in performance measures across the 51 MSAs are explained by the first and second PCA. Looking at the time series plots of the variances we can observe that the series are marginally stationary.

This figure shows principal component analysis for three performance measures (travel time index (top-left), planning time index (top-right), and congested hours (bottom)) in Austin.

Figure 13. Chart. Principal Component Analysis of Performance Measures.

The PCA of the external factors were computed using the natural logarithms to normalize the data. For the majority of the external factors, the first PCA is able to account for most of the variability among the external factors (Figure 14).

This figure shows principal component analysis for Gross Metropolitan Product.

Figure 14. Graphic. PCA for GMP.

For some other variables such as MADT, more than one PCA was found to be important to explain the correlation among the multivariate series (Figure 15). However, when inspecting the first four PCA's, it can be observed that the first PCA is a stationary series (i.e., the series oscillates around the mean). As the PCA order increases the series becomes less stationary and more stochastic thus indicating that the series can be reduced to one PCA only. As the conclusion, only the first order PCA of all variables was used for the empirical analysis.

This figure shows principal component analysis for Monthly Average Daily Traffic.

Figure 15. Chart. PCA Plots of MADT.

Granger Causality was applied to identify the most significant influential factors. The effects of both current and lagged effects (up to three lags) of the external factors are examined using the Granger test. The results are shown in Table 6. In this table only the marginally significant (p − value < 0.5) results have been reported. As it can be observed among the most significant external factors, the variables MADT, economic conditions index, number of employed, fuel price index, rental vacancy rate and total building permits appear to influence all three performance measures significantly. Leading effects indicate that the effect takes place after the indicated number of months, e.g., any changes in the economic condition index of the MSA will affect the travel time after one to three months. The descriptive statistics of the selected variables are also shown in Table 6.

Table 6. List of Selected External Factors.
Travel Time Index Planning Time Index Congested Hours
External Factor Leading Effects Granger Coefficient P-Value External Factor Leading Effects Granger Coefficient P-Value External Factor Leading Effects Granger Coefficient P-Value
MADT* 0 2.61 0.11 MADT 0 5.28 0.03 MADT 0 7.32 0.01
Economic Conditions Index* 1 – 3 months 6.72 0.00 Economic Conditions Index 1 – 3 months 3.35 0.03 Economic Conditions Index 1 – 3 months 3.19 0.04
Real Estate GDP 0 3.35 0.08
Number of Unemployed 0 3.05 0.09
Number of Employed* 0 2.41 0.13 Number of Employed 0 1.48 0.23 Number of Employed 0 2.38 0.13
Fuel Price Index* 1 – 3 months 2.15 0.11 Fuel Price Index 1 – 3 months 2.37 0.09 Fuel Price Index 1 – 3 months 4.64 0.02
Rental Vacancy Rate* 0 2.56 0.12 Rental Vacancy Rate 0 2.36 0.09 Rental Vacancy Rate 0 6.58 0.01
Home owner Rate 0 3.03 0.09 Home owner Rate 2 13.32 0.00
Rent Price 2 8.08 0.00
Total Building Permits* 0 – 1 month 6.66 0.01 Total Building Permits 0 – 1 month 1.07 0.30 Total Building Permits 0 – 1 month 3.53 0.06
Single-Family BP 0 5.89 0.02
Population Change 2 7.55 0.00
Average Temperature 3 1.67 0.19 Average Temperature 0 5.36 0.03

* External Factors selected for the analysis. Note: only significant factors are included in the table.

The results of the causality analysis indicate that the variables MADT, economic conditions index, number of employed, fuel price index, rental vacancy rate and total building permits are the most influential external factors. The economic conditions and fuel price indices have a leading effect (1 to 3 months) on the performance measures.

PRESENTATION AND CHART BEST PRACTICES

When creating visuals, specifically charts, graphs, and dashboards, care should be given to not only what data and information is displayed to the reader, but also how the chart visually appears. Giving thought to the layout, design, coloring, and formatting of a chart will help to effectively communicate the desired message.

The following principles should be followed to create visualizations that look professional, convey your particular message, and engage your identified audience.

Understand How People Process Information

Creating good visualizations requires that one understands how humans generally process and interpret information, on both a conscious and subconscious level, and then leverages this knowledge into a graphic.

First, we naturally approach visualizations with one of two primary goals: to explain or answer specific questions about something or to explore and glean valuable insights thereby increasing understanding.(4) In the latter, the viewer creates his own questions to answer based on the information seen. These two approaches are directly related to the type of audience and the intended message discussed earlier.

Second, how we approach visualizations (what we see) is heavily influenced by memory. Our long-term memory makes us expect a visualization to work a certain way (e.g., the lowest value to be zero and each increment equally spaced). We expect things to flow from left to right or more important elements to be bigger, darker, or otherwise distinguished. Separately, our working memory continually breaks information into small chunks (usually about three at a time) to process.(5) Any more information than this at one time can lead to information overload and a loss or confusion of the message. In short, keep it simple.

Third, our brains are trained to recognize patterns and differences. Psychologists have created two primary theories for how we do this. First are those things that catch our eye or ‘Preattentive attributes'.(6,7) These are subtle differences in patterns, colors, and position that the eye can perceive in about 10 milliseconds (see Figure 16). These attributes help our brains form patterns and organize information, leading to the second theory. The Gestalt Principle of grouping describes how our brains organize and describe information in groups similar to one another.(8) This principle also allows our brains to highlight important information and weed out noise that may distract from the primary message. Figure 17 illustrates the most common Gestalt principles.

This figure shows various pre-attentive attributes of visual perception, including the main categories of form, color, position, and motion.
Source: Few 2012(9)

Figure 16. Graphic. Pre-attentive Attributes of Visual Perception.

This figure shows the Gestalt principles of grouping, including proximity, similarity, enclosure, closure, continuity, and connection.
Source: http://www.fusioncharts.com/whitepapers/downloads/Principles-of-Data-Visualization.pdf

Figure 17. Graphic. Gestalt Principles of Grouping.

Figure 18 illustrates how these abstract principles operate in a chart.(8) First, note how the primary points of interest are colored dots against the grey of the full data sets for three data series and each series uses a different color (color, similarity, and intensity). Second, each dot utilizes size to convey information about that point (size). The dots' positions also give information for the series, drawing the eye to variances (the large yellow dot above the grey line) to groupings or clusters (proximity and spatial grouping). This chart type effectively uses colors, spatial orientation, size, proximity, enclosure, and similarity to effectively convey a vast amount of information, even though you fundamentally don't know much about what the data represent.

This figure shows an example of pre-attentive attributes and Gestalt principles being combined into a single chart display, and several attributes are highlighted, including grouping/proximity, size/position, similarity, and color/intensity.
Source: http://www.fusioncharts.com/whitepapers/downloads/Principles-of-Data-Visualization.pdf

Figure 18: Graphic. Example of Preattentive Attributes and Gestalt Principles in Action.

Keeping these three principles in mind will not only help you design engaging visualizations, but will also ensure that your message is properly conveyed.

Choose the Right Graphic

The first and often most overlooked step in creating a visualization is to decide what type of graphic should be used. While in the case of external factors your options may be limited, there are several different types of graphics that can be used depending upon your intended use and message. Marketing professionals have understood this for years and offer recommendations based on what type of message and analysis is being made.(9) The following bullets present suggestions of chart types based on intent:

  • Value Comparisons: Column, bar, circular area line, scatter plot, bullet.
  • Composition: Pie, stacked bar, stacked column, area, waterfall.
  • Distributions: Scatter plot, line, column, bar.
  • Trend Analysis: Line, dual-axis line, column.
  • Relationships: Scatter, bubble, line, heat maps, bump.

Note that this list is not exhaustive; there are several other types of charts that may also be used depending upon what information and message is desired. Avoid using pie charts, if possible, as they do not adequately allow the reader to comprehend the size differences if more than 3 or 4 categories are represented.

Depending on the audience and intended use, other media such as graphic art, video, or infographics may also be a suitable choice. Regardless of chart type, making the right selection will likely be the biggest factor in increasing understandability and decreasing the chance of an unclear or mixed message.

Do More with Less: Lines and Colors

Before discussing the details of the data itself, addressing the graphic from a visual standpoint will help guide its creation as data are included. Note that the process of building a chart is iterative; there is a constant need to adjust the focus on the data and design to create a graphic that clearly conveys the message to the target audience.

Lines

With all charts, use as few lines and elements as possible. If nothing else is followed, this one guiding principle should be paramount over any other. Default excel charts add superfluous lines, shapes, and visual noise that distract from the intended message. Carefully consider what lines actually are necessary and give meaning. If a line or element does not give specific meaning to the chart, eliminate it. This can include:

  • Borders.
  • 3D Char effects.
  • Unnecessary tick marks.
  • Gridlines.
  • Trend lines (unless specifically needed).
  • Unnecessary axes that lack value.
  • Too many labels or legends.
Color

Using color is integral to creating engaging visualizations. Color can easily convey where data trends improved, worsened, or stayed neutral; can signify direction; or can classify groups and categories. However, there are a few simple concepts to ensure the use of color is maximized in a visualization.

  1. Use colors people can see. Be sure to use colors and color pairings that allow color-blind people to still see differences. Use specialized color-blind palates and pairing, avoiding red and green combinations for red and blue. Additionally, think about how colors may appear if printed with a black and white printer.

    Use complementary colors. Avoid using color combinations that are difficult to distinguish to the eye. Instead, use colors that are opposite from each other on the color wheel (except red and green). Figure 19 displays a color wheel one can use to find complementary colors. Simply choose an initial color and match it with a color on the opposite side. Use medium intensity colors for the base color, reserving sharper and lighter hues to highlight relevant or irrelevant information.(10) Use diverging or dual-ordered palettes to visualize a range or good to bad scales (or other extremes). Figure 19 illustrates complementary colors, while Figure 20 exemplifies dual-ordered colors.

    This figure shows a color wheel, which has 108 colors arranged in a circular pattern that corresponds to complementary colors.
    Source: https://www.doobert.com/complementary-colors/

    Figure 19. Graphic. Sample Complementary Color Wheel.

    This figure shows ordered and dual-order color palettes, which are used to visualize a range of good to bad scales.
    Source: Few 2012(9)

    Figure 20. Graphic. Ordered and Dual-Ordered Color Palettes.

  2. Use colors in a theme. Display the same type of information with the same color. This uses the preattentive attribute to allow the brain to quickly associate what different data are without needing to refer to a legend. Use ordered color palettes (color from light to dark) of the same color to show intensity.

    Also, try to match colors in charts to other color shades in the document or website. If there is a particular palette used in the logo or theme, continuing that palette will keep the focus on the visualization's message.

  3. When in doubt, grey it out. The use of grey can be powerful in pointing something out in the data. Broadly, grey should be used for axis lines, tick marks, and other necessary chart markers. Grey can also be used to make a statement: make an entire chart different shades of grey with one or two elements colored. This will make those elements pop and draw the eye.

Chart Elements, Legends, and Labels

Chart elements are anything other than a legend or data/axis labels that are included in a chart. Chart elements may, for example, include equations, explanatory text, arrows, or other marks. In general, a similar rule applies as for lines: less is better, unless necessary. This rule also applies for legends and labels: every chart should have these, but eliminate unnecessary text. Keeping your chart focused, well organized, and clean will keep the reader on task and your message from being lost.(10, 11)

Axes

Every chart should have both axes labeled with equally-spaced tick marks. Axis lines and major gridlines (if necessary for clarity) should appear in grey, allowing the data to pop. The vertical axis should always start at zero, with both axes maintaining appropriate aspect ratios. Distortion in width and height can make trends in the data appear more or less severe.

In dual axis charts (with a set of bars and a line both meaning different things), extra consideration should be given to add clarity as to which data refer with which axis.

Legends

Every chart, no matter how intuitive, should have a legend of some kind. Legends can be free-floating in an open area or as labels directly adjacent to the lines or bars. Either way, it should be clear what each line or bar means and to what they correspond.

Labels

Data in charts should have labels, either directly next to the points or in a data table below the chart that corresponds with the information. Be careful not to overwhelm the reader with information; sometimes, it may be better to only include labels for relevant or important data points, depending upon your message.

Titles

All charts should have a title. Titles should be descriptive, succinct, and convey some or all of the message the reader should obtain.

Data

Before creating a chart, be sure to sort the data in ascending or descending order, unless the order of the category matters. For time series data, ensure equal spacing, even if intermittent time periods are included (or consider excluding those periods).

Consider seasonally adjusting data in order to make comparisons at intervals of less than one year. Without seasonal adjustment, only annual comparisons are appropriate, hindering the reader from understanding changes in the data.

Round data labels and figures to the nearest significant figure. If in millions or billions, consider shortening or adding an "M" or "B" after a two-digit number.

If data series are being stacked on the same graph, be sure to align each series with the other. The one exception to this is if there is a noted lead or lag that is part of the intended message. If so, this should be clearly and overtly displayed.

REFERENCES FOR APPENDIX

  1. Peña D. and A. Poncela. 2006. Dimension Reduction in Multivariate Time Series: Chapter in Advances in Distribution Theory, Order Statistics, and Inference. Springer 2006.
  2. Granger C. W. J. 1991. Causality, cointegration and control. Journal of Economic Dynamics and Control. Vol. 12, 1991, pp. 551-559.
  3. Arellano M. and S. Bond. 1991. Some tests of specification for Panel Data. Review of Economic Studies, Vol. 58, pp. 277-297.
  4. Fusion Charts. Principles of Data Visualization – What We See in a Visual. http://www.fusioncharts.com/whitepapers/downloads/Principles-of-Data-Visualization.pdf. Accessed September 4, 2017.
  5. Ware, C. Information Visualization, 3rd Edition. Morgan Kaufmann, 2012.
  6. Preattentive Visual Properties and How to Use Them in Information Visualization. Interaction Design Foundation, 2016. https://www.interaction-design.org/literature/article/preattentive-visual-properties-and-how-to-use-them-in-information-visualization. Accessed September 4, 2017.
  7. Goldstein, B. Perceiving Objects and Scenes: The Gestalt Approach to Object Perception. Sensation and Perception, 8th Edition. Cengage Learning, 2009.
  8. Oetting, J. Data Visualization 101: How to Choose the Right Chart or Graph for Your Data. HubSpot Marketing, 2016. https://blog.hubspot.com/marketing/data-visualization-choosing-chart. Accessed September 7, 2017.
  9. Few, Stephen. Show Me the Numbers: Designing Tables and Graphs to Enlighten. Analytics Press, 2012.
  10. Tufte, E. The Visual Display of Quantitative Information, 2nd Edition. Graphic Press, 2001.
  11. Peltier, J. Ten Chart Design Principles: Guest Post. Peltier Technical Services, Inc. 2008. https://peltiertech.com/ten-chart-design-principles-guest-post/. Accessed September 5, 2017.

You may need the Adobe® Reader® to view the PDFs on this page.

Office of Operations