Carlos D. Correa and Yu-Hsuan Chan and Kwan-Liu Ma.
A Framework for Uncertainty-Aware Visual Analytics.
In IEEE Symposium on Visual Analytics Science and Technology (VAST), pp. 51--58, 2009.


Links:

Abstract:

Visual analytics has become an important tool for gaining insight on large and complex collections of data. Numerous statistical tools and data transformations, such as projections, binning and clustering, have been coupled with visualization to help analysts understand data better and faster. However, data is inherently uncertain, due to error, noise or unreliable sources. When making decisions based on uncertain data, it is important to quantify and present to the analyst both the aggregated uncertainty of the results and the impact of the sources of that uncertainty. In this paper, we present a new framework to support uncertainty in the visual analytics process, through statistic methods such as uncertainty modeling, propagation and aggregation. We show that data transformations, such as regression, principal component analysis and k-means clustering, can be adapted to account for uncertainty. This framework leads to better visualizations that improve the decision-making process and help analysts gain insight on the analytic process itself.

Bibtex:

@InProceedings{  correa:2009:UAVA,
  author = 	 {Carlos D. Correa and Yu-Hsuan Chan and Kwan-Liu Ma},
  title = 	 {A Framework for Uncertainty-Aware Visual Analytics},
  booktitle =    {{IEEE} Symposium on Visual Analytics Science and
                  Technology (VAST)},
  pages = 	 {51--58},
  year = 	 {2009},
}

Images:

References:

S. Barlowe, T. Zhang, Y. Liu, J. Yang, and D. Jacobs. Multivariate visual explanation for high dimensional datasets. pages 147-154, Oct. 2008.
P. Berkhin. Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA, 2002.
G. Box and N. Draper. Empirical Model-Building and Response Sur- faces. John Wiley & Sons, 1987.
J. Carroll and P. Arabie. Multidimensional scaling. Annual Review of Psychology, 31:607-649, 1980.
K. Chan, A. Saltelli, and S. Tarantola. Sensitivity analysis of model output: variance-based methods make the difference. In WSC '97: Proceedings of the 29th conference on Winter simulation, pages 261- 268, 1997.
M. Chau, R. Cheng, B. Kao, and J. Ng. Uncertain data mining: An example in clustering location data. In Proc. of the 10th Pacific- Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), pages 199-204, 2006.
G. Cormode and A. McGregor. Approximation algorithms for clus- tering uncertain data. In PODS '08: Proceedings of the twenty- seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pages 191-200, New York, NY, USA, 2008. ACM.
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Sta- tistical Society. Series B (Methodological), 39(1):1-38, 1977.
H. Dolfing. A visual analytics framework for feature and classifier engineering. Master's thesis, University of Konstanz, 2007.
N. R. Draper and H. Smith. Applied Regression Analysis (Wiley Series in Probability and Statistics). John Wiley & Sons Inc, 2 sub edition, 1998.
H. Frey and S. Patil. Identification and review of sensitivity analysis methods. Risk Analysis, 22(3):553-578, 2002. [12] J. Y. Halpern. Reasoning about Uncertainty. The MIT Press, October 2003.
D. J. Harrison and D. L. Rubinfeld. Hedonic housing prices and the demand for clean air. Journal of Environmental Economics and Man- agement, 5(1):81-102, March 1978.
T. Hastie and R. Tibshirani. Generalized Additive Models. Chapman and Hall, 1990.
F. O. Hoffman and J. S. Hammonds. Propagation of uncertainty in risk assessments: The need to distinguish between uncertainty due to lack of knowledge and uncertainty due to variability. Risk Analysis, 14(5):707-712, 1994.
G. Hunter and M. Goodchild. Managing uncertainty in spatial databases: Putting theory into practice. Journal of Urban and Re- gional Information Systems Association, 5(2):55 -62, 1993.
D. A. Keim, F. Mansmann, J. Schneidewind, and H. Ziegler. Chal- lenges in visual data analysis. In IV '06: Proceedings of the confer- ence on Information Visualization, pages 9-16, 2006.
D. Kurowicka and R. Cooke. Uncertainty Analysis with High Dimen- sional Dependence Modeling. Wiley, 2006.
J. B. MacQueen. Some methods for classification and analysis of mul- tivariate observations. In L. M. L. Cam and J. Neyman, editors, Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Proba- bility, volume 1, pages 281-297. University of California Press, 1967.
W. K. Ngai, B. Kao, C. K. Chui, R. Cheng, M. Chau, and K. Yip. Efficient clustering of uncertain data. pages 436-445, Dec. 2006.
A. Pang, C. M. Wittenbrink, and S. K. Lodha. Approaches to uncer- tainty visualization. The Visual Computer, 13(8):370-390, 1997.
B. Pham and R. Brown. Visualisation of fuzzy systems: requirements, techniques and framework. Future Gener. Comput. Syst., 21(7):1199- 1212, 2005.
M. M. Putko, P. A. Newman, A. C. T. Iii, and L. L. Green. Approach for uncertainty propagation and robust design in cfd using sensitivity derivatives. In AIAA 15th Computational Fluid Dynamics Conference, pages 2001-2528, 2001.
C. R.M. and V. N. J.M. Generalized graphical methods for uncertainty and sensitivity analysis. Bashkir Ecological Journal, (Special Issue), 1(8):54-57, 2000. J. Shlens. A tutorial on principal component analysis, December 2005.
Y. Tanaka. Recent advance in sensitivity analysis in multivariate sta- tistical methods. Journal of the Japanese Society of Computational Statistics, 7(1):1-25, 1994.
B. N. Taylor and C. E. Kuyatt. Guidelines for evaluating and express- ing the uncertainty of NIST measurement results. Technical report, NIST Tecnical Note 1297, 1994.
S. Thompson. Sampling. 1992.
J. Thomson, E. Hetzler, A. Maceachren, M. Gahegan, and M. Pavel. A typology for visualizing uncertainty. In R. F. Erbacher, J. C. Roberts, M. T. Gr{\"o}hn, and K. B{\"o}rner, editors, Visualization and Data Analysis 2005. Proceedings of the SPIE, Volume 5669, pages 146-157, March 2005.
V. \v{S}midl and A. Quinn. On bayesian principal component analysis. Comput. Stat. Data Anal., 51(9):4101-4123, 2007.
N. Wiener. The homogeneous chaos. American Journal of Mathemat- ics, 60(4):897-936, 1938.
Y. Yamanishi and Y. Tanaka. Sensitivity analysis in functional prin- cipal component analysis. Computational Statistics, 20(2):311-326, 2005.
D. Yang, E. A. Rundensteiner, and M. O. Ward. Analysis guided visual exploration of multivariate data. In Visual Analytics Science and Tech- nology, 2007. VAST 2007. IEEE Symposium on, pages 83-90, 2007.
Y. Yao. Interval based uncertain reasoning. Fuzzy Information Pro- cessing Society, 2000. NAFIPS. 19th International Conference of the North American, pages 363-367, 2000.
T. Zuk and M. S. T. Carpendale. Visualization of uncertainty and reasoning. In Smart Graphics, pages 164-177, 2007.