Christopher Collins
PhD
Canada Research Chair in Linguistic Information Visualization
Associate Professor
Expert in managing linguistic data and understanding its importance to Canada
Areas of expertise
- PhD - Computer Science, Knowledge Media Design University of Toronto, Toronto, Ontario 2010
- MSc - Computer Science University of Toronto, Toronto, Ontario 2004
- BSc - Computer Science and Chemistry Memorial University of Newfoundland, St. John's Newfoundland 2001
Best Paper Honourable Mention: Progressive Learning of Topic Modeling Parameters: A Visual Analytics Framework
Published in IEEE Transactions on Visualization and Computer Graphics Volume: 24 Issue: 1 January 1, 2018Mennatallah El-Assady, Rita Sevastjanova, Fabian Sperrle, Daniel Keim & Christopher Collins
Topic modeling algorithms are widely used to analyze the thematic composition of text corpora but remain difficult to interpret and adjust. Addressing these limitations, we present a modular visual analytics framework, tackling the understandability and adaptability of topic models through a user-driven reinforcement learning process which does not require a deep understanding of the underlying topic modeling algorithms. Given a document corpus, our approach initializes two algorithm configurations based on a parameter space analysis that enhances document separability. We abstract the model complexity in an interactive visual workspace for exploring the automatic matching results of two models, investigating topic summaries, analyzing parameter distributions, and reviewing documents. The main contribution of our work is an iterative decision-making technique in which users provide document-based relevance feedback that allows the framework to converge to a user-endorsed topic distribution. We also report feedback from a two-stage study which shows that our technique results in topic model quality improvements on two independent measures.
Metatation: Annotation as Implicit Interaction to Bridge Close and Distant Reading
Published in ACM Transactions on Computer-Human Interaction Volume 24: Issue 5 November 1, 2017Hrim Mehta, Adam Bradley, Mark Hancock & Christopher Collins
In the domain of literary criticism, many critics practice close reading, annotating by hand while performing a detailed analysis of a single text. Often this process employs the use of external resources to aid analysis. In this article, we present a study and subsequent tool design focused on leveraging a critic’s annotations as implicit interactions for initiating context-specific computational support that automatically searches external resources. We observed 14 poetry critics performing a close reading, revealing a set of cognitive practices supported through free-form annotation that have not previously been discussed in this context. We used guidelines derived from our study to design a tool, Metatation, which uses a pen-and-paper system with a peripheral display to utilize reader annotations as underspecified interactions to augment close reading. By turning paper-based annotations into implicit queries, Metatation provides relevant supplemental information in a just-in-time manner and acts as a bridge between close and distant reading.
Perceptual Biases in Font Size as a Data Encoding Sign In or Purchase to View Full Text 135 Full Text Views Related Articles Mean
Published in IEEE Transactions on Visualization and Computer Graphics Volume: 24, Issue: 8 July 4, 2017Eric Alexander, Chih-Ching Chang, Mariana Shimabukuro, Steven Franconeri, Christopher Collins, Michael Gleicher
Many visualizations, including word clouds, cartographic labels, and word trees, encode data within the sizes of fonts. While font size can be an intuitive dimension for the viewer, using it as an encoding can introduce factors that may bias the perception of the underlying values. Viewers might conflate the size of a word's font with a word's length, the number of letters it contains, or with the larger or smaller heights of particular characters (‘o’ vs. ‘p’ vs. ‘b’). We present a collection of empirical studies showing that such factors-which are irrelevant to the encoded values-can indeed influence comparative judgements of font size, though less than conventional wisdom might suggest. We highlight the largest potential biases, and describe a strategy to mitigate them.
NEREx: Namedâ€Entity Relationship Exploration in Multiâ€Party Conversations
Published in Computer Graphics Forum Volume: 36, Issue: 3 July 1, 2017Mennatallah Elâ€Assady, Rita Sevastjanova, Bela Gipp, Daniel Keim & Christopher Collins
We present NEREx, an interactive visual analytics approach for the exploratory analysis of verbatim conversational transcripts. By revealing different perspectives on multiâ€party conversations, NEREx gives an entry point for the analysis through highâ€level overviews and provides mechanisms to form and verify hypotheses through linked detailâ€views. Using a tailored namedâ€entity extraction, we abstract important entities into ten categories and extract their relations with a distanceâ€restricted entityâ€relationship model. This model complies with the often ungrammatical structure of verbatim transcripts, relating two entities if they are present in the same sentence within a small distance window.
Exploring the Possibilities of Embedding Heterogeneous Data Attributes in Familiar Visualizations
Published in IEEE Transactions on Visualization and Computer Graphics Volume: 23, Page 581-590 January 1, 2017Mona Hosseinkhani Loorak, Charles Perin, Christopher Collins & Sheelagh Carpendale
Heterogeneous multi-dimensional data are now sufficiently common that they can be referred to as ubiquitous. The most frequent approach to visualizing these data has been to propose new visualizations for representing these data. These new solutions are often inventive but tend to be unfamiliar. We take a different approach. We explore the possibility of extending well-known and familiar visualizations through including Heterogeneous Embedded Data Attributes (HEDA) in order to make familiar visualizations more powerful. We demonstrate how HEDA is a generic, interactive visualization component that can extend common visualization techniques while respecting the structure of the familiar layout. HEDA is a tabular visualization building block that enables individuals to visually observe, explore, and query their familiar visualizations through manipulation of embedded multivariate data. We describe the design space of HEDA by exploring its application to familiar visualizations in the D3 gallery. We characterize these familiar visualizations by the extent to which HEDA can facilitate data queries based on attribute reordering.
Optimizing Hierarchical Visualizations with the Minimum Description Length Principle
Published in IEEE Transactions on Visualization and Computer Graphics Volume: 23, Issue: 1 January 1, 2017Rafael Veras & Christopher Collins
In this paper, we examine how the Minimum Description Length (MDL) principle can be used to efficiently select aggregated views of hierarchical datasets that feature a good balance between clutter and information. We present MDL formulae for generating uneven tree cuts tailored to treemap and sunburst diagrams, taking into account the available display space and information content of the data. We present the results of a proof-of-concept implementation. In addition, we demonstrate how such tree cuts can be used to enhance drill-down interaction in hierarchical visualizations by implementing our approach in an existing visualization tool. Validation is done with the feature congestion measure of clutter in views of a subset of the current DMOZ web directory, which contains nearly half-million categories. The results show that MDL views achieve near-constant clutter levels across display resolutions. We also present the results of a crowdsourced user study where participants were asked to find targets in views of DMOZ generated by our approach and a set of baseline aggregation methods. The results suggest that, in some conditions, participants are able to locate targets (in particular, outliers) faster using the proposed approach.
ConToVi: Multi-Party Conversation Exploration using Topic-Space Views
Published in http://vialab.science.uoit.ca/portfolio/contovi Volume: 35, Issue: 3, Page: 431-440 July 4, 2016Mennatallah El-Assady, Valentin Gold, Carmela Acevedo, Christopher Collins and Daniel Keim
We introduce a novel visual analytics approach to analyze speaker behaviour patterns in multi-party conversations. We propose Topic-Space Views to track the movement of speakers across the thematic landscape of a conversation. Our tool is designed to assist political science scholars in exploring the dynamics of a conversation over time to generate and prove hypotheses about speaker interactions and behaviour patterns. Moreover, we introduce a glyph-based representation for each speaker's turn based on linguistic and statistical cues to abstract relevant text features. We present animated views for exploring the general behaviour and interactions of speakers over time and interactive steady visualizations for the detailed analysis of a selection of speakers. Using a visual sedimentation metaphor we enable the analysts to track subtle changes in the flow of a conversation over time while keeping an overview of all past speaker turns. We evaluate our approach on real-world datasets and the results have been insightful to our domain experts.
PhysioEx: Visual Analysis of Physiological Event Streams
Published in Computer Graphics Forum Volume: 35, Issue: 3 July 4, 2016Rishikesan Kamaleswaran, Christopher Collins, Andrew James & Carolyn McGregor
In this work, we introduce a novel visualization technique, the Temporal Intensity Map, which visually integrates data values over time to reveal the frequency, duration, and timing of significant features in streaming data. We combine the Temporal Intensity Map with several coordinated visualizations of detected events in data streams to create PhysioEx, a visual dashboard for multiple heterogeneous data streams. We have applied PhysioEx in a design study in the field of neonatal medicine, to support clinical researchers exploring physiologic data streams. We evaluated our method through consultations with domain experts. Results show that our tool provides deep insight capabilities, supports hypothesis generation, and can be well integrated into the workflow of clinical researchers.
Best Paper Honourable Mention: #FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media
Published in IEEE Trans. on Visualization and Computer Graphics Volume: 20, Issue: 12 December 31, 2014Jian Zhao, Nan Cao, Zhen Wen, Yale Song, Yu-Ru Lin & Christopher Collins
We present FluxFlow, an interactive visual analysis system for revealing and analyzing anomalous information spreading in social media. Everyday, millions of messages are created, commented, and shared by people on social media websites, such as Twitter and Facebook. This provides valuable data for researchers and practitioners in many application domains, such as marketing, to inform decision-making. Distilling valuable social signals from the huge crowd's messages, however, is challenging, due to the heterogeneous and dynamic crowd behaviours. The challenge is rooted in data analysts' capability of discerning the anomalous information behaviours, such as the spreading of rumours or misinformation, from the rest that are more conventional patterns, such as popular topics and newsworthy events, in a timely fashion. FluxFlow incorporates advanced machine learning algorithms to detect anomalies, and offers a set of novel visualization designs for presenting the detected threads for deeper analysis. We evaluated FluxFlow with real datasets containing the Twitter feeds captured during significant events such as Hurricane Sandy.
DimpVis: Exploring Time-varying Information Visualizations by Direct Manipulation
Published in IEEE Transactions on Visualization and Computer Graphics Brittany Kondo ; Christopher Collins December 31, 2014Brittany Kondo & Christopher Collins
We introduce a new direct manipulation technique, DimpVis, for interacting with visual items in information visualizations to enable exploration of the time dimension. DimpVis is guided by visual hint paths which indicate how a selected data item changes through the time dimension in a visualization. Temporal navigation is controlled by manipulating any data item along its hint path. All other items are updated to reflect the new time. We demonstrate how the DimpVis technique can be designed to directly manipulate position, colour, and size in familiar visualizations such as bar charts and scatter plots, as a means for temporal navigation. We present results from a comparative evaluation, showing that the DimpVis technique was subjectively preferred and quantitatively competitive with the traditional time slider, and significantly faster than small multiples for a variety of tasks.
Book Chapter: Simple Multi-Touch Toolkit
Published in In SurfNet: Designing Digital Surface Applications January 1, 2016Erik Paluka, Zachary Cook, Mark Hancock & Christopher Collins
While multi-touch computing becomes more common, there comes a requirement for students to learn how to create software for multi-touch environments. Although there are many powerful toolkits that exist already, they require a strong programming background and thus become difficult to integrate into fast-paced human-computer interaction (HCI) courses or for non-CS students to use. Researchers at the University of Ontario Institute of Technology (UOIT) and the University of Waterloo (UW) have developed a toolkit with a simplified API called the Simple Multi-Touch Toolkit (SMT).
Â鶹´«Ã½ University Research Excellence Award (Early Career)
Â鶹´«Ã½ University November 10, 2015Â鶹´«Ã½ University's Research Excellence Awards recognize faculty who have achieved national and/or international success and recognition through their research activities and enhanced Â鶹´«Ã½ University's reputation as a research-focused institution.