Since quite some time, Google provides an autocomplete feature extended to combinations of words. That’s quite an useful feature because often it “saved” my time from typing full words or combinations of words. What’s interesting is that the autocomplete algorithm provides the terms based on user’s search activities. I was asking myself if we could do more with search queries. This evening, while browsing, I discovered A. Smarty’s post on “How To Visualize and Play with Google Suggest Results”, in which she shortly presents three interesting tools: Web Seer, What do you suggest and Soovle. As I found out there are several other tools like Übersuggest, Quintura, etc. In this post I will focus only on Web Seer, following to review shortly several other similar tools in the next posts.
Web Seer allows users to compare the “matches” between two Google queries, for example “are man” vs. “are women”, “will he” vs. “will she”. To remain in blog’s thematic , I checked tool’s output for “data” vs. “information” and “information” vs. “knowledge”:
The query results for both terms are somehow predictable – “data mining”, “data warehouses”, “data entry” and “data values”, respectively “information architecture”, “information management”, “information security”, “information technology”, “information is beautiful” (see also the book) are quite popular terms in the scientific and non-scientific literature. I would expect the comparison is based on the most popular terms, because the two concepts don’t share many common terms, and even if there are some common terms within the above results (e.g. “data architecture”, “data systems”) they aren’t highly ranked. Arrows’ weight depicts the number of occurrences of the respective terms, which combined with the terms themselves, help to make an idea of the strength and resemblance existing between two concepts.
Climbing the DIKW scale here are the comparisons between information and “knowledge”, respectively “knowledge” and “wisdom”:
As it seems the results are consistent between relations, same combinations being used in two comparisons in which the same term is involved, life in the above diagrams. It’s natural that the results are also commutative, in order words “knowledge” vs. “information” renders same result as “information” vs. “knowledge”.
The association is also reflexive:
And transitive, as “data” vs. “information”, and “information” vs. “knowledge” lead to “data” vs. “knowledge”:
The algebraical operations are not so important, though some consistency of the results is needed between representations. It’s interesting that the comparison is influenced by a space placed at the beginning (e.g. “ data”) or end (“data ”), as can be seen in the following representation of the two:
I would expect other similar signs (e.g. punctuation signs, special characters) influence the comparisons too. Talking about DIKW, the knowledge pyramid, let’s see the comparison between “DIKW” and “data information knowledge wisdom”:
As the two concepts have close semantics, “DIKW” is the acronym for “data information knowledge wisdom”, here’s the comparison between two synonyms: “distribution” vs. “diffusion” (like in distribution/diffusion of knowledge). As can be seen the association is stronger.
Actually the first attempt with the tool was a comparison “concept map” vs. “mind map”:
Which looks slightly different than “concept maps” vs. “mind maps” (so the plural form of words introduces variances):
Considering the few examples run, the tool is quite intuitive and catchy. I would consider its utility as relative, even if the above examples are not representative and the relationships between them are more contextual. Still it’s a good tool for identifying automatically the relations/associations between concepts, to identify associations’ strength and maybe several semantic connotations. It would be interesting to see only the common terms, as many K-maps focus on this aspects, to introduce language and context, and the possibility to compare more than two terms (for example using Venn diagrams) or to show more/less common terms.