entice all of the hype as of late inside knowledge science, however I’d argue they’re each secondary to a extra essential—and often-ignored—part of the sector.
When coping with knowledge, there are two important steps:
- Processing and analyzing the information to extract significant insights.
- Conveying these insights to others.
The second level is essential and sometimes neglected. The world’s most superior algorithm or useful perception is ineffective if nobody can perceive it. As an information scientist, you will need to study to convey your insights to others. There may be a couple of motive for this, with the obvious one being that if the proper folks perceive the information, the world at massive will profit. Nonetheless, there may be one other equally essential motive: It’s usually in describing our findings to others that we uncover errors, extra profound information, or additional areas for exploration.
On this article, we’ll study a robust and efficient software which may help obtain the second step above: knowledge visualization. That is the primary in a collection of articles that may take absolute newbies deep into the realm of knowledge visualization. This primary article is basic and lightweight, supposed as an introduction to the sector as a complete. In later articles, I’ll get into the extra technical features, ultimately concluding by instructing you how you can construct your individual knowledge visualizations.
With that information, you’ll be armed to sort out your knowledge in new, thrilling methods.
“The best worth of an image is when it forces us to note what we by no means anticipated to see.” –John Tukey
What Counts as a Knowledge Visualization?
Many individuals view knowledge visualization by means of a restricted lens, solely classifying customary graphs, similar to bar charts, line charts, and the like, as true knowledge visualizations. Seen from this angle, knowledge visualization didn’t materialize till the center of the 18th century. (We’ll see some examples under.)
Nonetheless, we might do effectively to broaden our minds. Visible transformations of knowledge are in no way restricted to our conventional concepts. They’ve been round for hundreds of years. For instance, right here is the Imago Mundi [1], the oldest identified map on the planet, found as a relic of the traditional metropolis of Babylon:
This map locations Babylon on the heart and was possible an especially useful gizmo for visualizing what we now formally name geospatial knowledge. It is among the world’s earliest knowledge visualizations.
There are a plethora of comparable figures and pictures from numerous historic civilizations—cave work, calendars, stone carvings, even Egyptian hieroglyphics—these are all successfully visible representations of knowledge that had been obscure of their preliminary kind. Viewing these examples as knowledge visualizations leads us to an essential precept:
At its core, knowledge visualization is nothing greater than taking some knowledge—be it numerical, textual, or in any other case—and making use of a change to symbolize it visually.
This foundational precept results in a number of associated subjects primarily involving the best strategies to conduct these transformations, the place efficient loosely interprets to “trustworthy, simple to grasp, and informative.”
Early Examples of Knowledge Visualizations
Now that we’ve broadened our views regarding what constitutes an information visualization, allow us to check out some trendy examples. Beneath is a chart from 1644 developed by Michael Florent Van Langren [2]. It is among the earliest graphical representations of what we take into account to be conventional statistical knowledge, depicting estimates of the distinction in longitude between Rome and Toledo.

Let’s take into account a extra concerned instance subsequent—one which instantly highlights Tukey’s quote above.
Beneath is a map of London’s Soho District in 1854 [3]. It was designed by John Snow with a view to decide if there have been any patterns within the cholera outbreak that was debilitating the city on the time:

Trying towards the middle of the map, we will see an exceptionally massive variety of deaths close to the water pump on Broad Avenue. An investigation decided that this pump was contaminated and was a serious reason behind the unfold of the illness.
This instance highlights precisely the precept from John Tukey we famous above: Among the best makes use of of knowledge visualization is to rapidly see insights which can be troublesome to seek out within the knowledge’s preliminary kind.
Precision and Flexibility
Knowledge visualization is a broad and deep subject that may be approached in some ways. That mentioned, there are two rules that you need to have in mind regardless of the particular type of knowledge visualization you interact in: precision and flexibility.
knowledge visualization doesn’t attempt to accomplish ill-defined duties, similar to displaying the essence of or summarizing every little thing essential a couple of knowledge set. Statements like these are subjective and basically not possible to realize.
Somewhat, a great knowledge visualization highlights a selected and well-defined side of the related knowledge in a means that makes it simpler to grasp for the person. It is best to all the time articulate precisely what you need to categorical about your knowledge earlier than you even start designing a visualization.
To internalize this precept, it’s useful to recall what the aim of an information visualization is to start with: to show insights from an information set in a transparent and helpful means. We need to make the information simpler to grasp. Being exact ensures we obtain this aim. A visualization that makes an attempt to do an excessive amount of may find yourself complicated the viewer much more. It’s significantly better to provide a visualization which covers much less knowledge in a clearer means. High quality is extra essential than amount.
Check out the information desk under, which accommodates details about salaries from completely different cities round america.
| Identify | Metropolis | Revenue | Occupation |
|---|---|---|---|
| Sarah Mitchell | Denver, CO | $72,500 | Advertising Supervisor |
| Jamal Rodriguez | Houston, TX | $58,300 | Electrician |
| Priya Desai | Seattle, WA | $91,200 | Software program Engineer |
| Thomas Nguyen | Chicago, IL | $64,800 | Nurse |
Which of the next is the higher visualization selection for the above knowledge?
- A visualization that makes an attempt to simplify the data within the knowledge desk utilizing a bar chart that has names on one axis and salaries on the opposite axis, makes use of coloration to distinguish amongst cities, and makes use of a texture on the bars (dashed strains, diagonal strains, and many others.) to tell apart amongst careers.
- The identical visualization as above, however this time excluding the majors. In different phrases, a bar chart of names and salaries which colours the bars primarily based on location.
It’s tempting to decide on the primary one, however the truth is, it tries to do an excessive amount of. Higher to show restricted, focused info than to confuse your viewers.
Along with being exact, sustaining flexibility can be essential. There isn’t any such factor as an ideal knowledge visualization. There may be all the time room for enchancment, and knowledge visualizations typically turn out to be higher with every revision. After all, in some unspecified time in the future, an information visualization have to be shared with others and serve its function.
This results in a quandary—how a lot revision is sufficient revision? There isn’t any definitive reply to this query. The method of revising a visualization have to be undertaken with care. Asking too many individuals for recommendation will possible lead to a bunch of half-baked, conflicting opinions. However, publishing the primary draft of a visualization—i.e., not revising it in any respect—is prone to result in a subpar end result.
Though there isn’t a good resolution, there are a couple of tips you’ll be able to comply with:
- Establish 2-3 folks to provide you suggestions in your visualization.
- Strive to make sure your checklist of individuals encompasses the next:
- A reviewer who’s proficient in designing knowledge visualizations
- A reviewer who has a powerful understanding of the information that’s getting used to develop the visualization (e.g., a political scientist for election knowledge)
- A reviewer who’s a part of the supposed viewers for the visualization
- Undergo 2-3 rounds of suggestions and revision with this identical checklist of individuals. This may be certain that enhancements to the visualization are steady and logical.
Ultimate Ideas and Trying Ahead
In some ways, knowledge visualization is akin to writing. Even probably the most prolific and gifted authors have editors, and their books undergo in depth revision earlier than being accredited for publishing. Why? For the straightforward motive that good writing is basically depending on the viewers, and punctiliously curated revision ensures the perfect expertise for the eventual readers of a ebook. The identical concept applies to knowledge visualization.
By following these tips, you’ll be able to make sure you develop a sturdy knowledge visualization which is grounded in finest practices, accurately shows the information at hand, and is comprehensible for the supposed viewers.
They’re the important thing to efficient knowledge visualization, and the muse for superior visualization methods that can be mentioned in future articles. Till then.
References
[1] https://commons.wikimedia.org/wiki/File:The_Babylonian_map_of_the_world,_from_Sippar,_Mesopotamia..JPG
[2] The Visible Show of Quantitative Data, Edward Tufte
[3] https://picryl.com/media/snow-cholera-map-1-cbadea

