Introduction to Network Visualization with GEPHI

by Martin Grandjean | 1.07.2013 | 14 comments

A completely new version of this tutorial has been published, with 2 complete and complementary datasets to learn and explore many basic and advanced features of Gephi: To the new tutorial

Gephi workshop at University of Bern (photo Radu Suciu)

Social Network Analysis is a lens, a way of looking at reality. (Claire Lemercier at Swiss Digital Humanities Summer School 2013)

Network Analysis appears to be an interesting tool to give the researcher the ability to see its data from a new angle. Because Gephi is an easy access and powerful network analysis tool, here is a tutorial that should allow everyone to make his first experiments.

I propose below, after a short introduction about the basis of SNA and some examples which shows the potential of this tool, a transcript of tutorial given during a workshop of the first Digital Humanities summer school in Switzerland (June 28. 2013), and kept up to date.

1. Short introduction to Social Network Analysis

A network consists of two components : a list of the actors composing the network, and a list of the relations (the interactions between actors). As part of a mathematical object, actors will then be called vertices (nodes, in Gephi), and relations will be denoted as tiles (edges, in Gephi).

4 types of centrality measures (Claudio Rocchini, Wikimedia)

By left, you can observe a very simple social graph, with both lists explicited. Two attributes are attached to the nodes : a label (his or her “name”) and a numeric attribute (akin to the sex of people here, for example). In the edge list, “Source” and “Target” entries refer to the nodes’ numeric identifiers (Id).

In our example, the “sex” attribute determines the color of the nodes. The size of a node depends on the value of its “degree centrality” (its number of connexions). The centrality measures are essential metrics to analyze the position of an actor in a network. They come in many variations, as shown at right (A = Degree centrality, number of connexions ; B = Closeness centrality, closeness to the entire network ; C = Betweenness centrality, bridges nodes ; D = Eigenvector centrality, connexion to well-connected nodes).

2. Graphs with GEPHI: some examples

3. Downloading GEPHI and the dataset

Gephi Dataset(edges)Dataset (nodes)

Download the application and both CSV files. This tutorial is based on the 0.8.2 Gephi beta version. If you encounter a problem due to a later update, do not hesitate to let me know.

The data consist of a random selection of Twitter users and their “followings” relations. The “Nodes” file contains the identifiers of each nodes, their label, a sex attribute and a random value that will be usefull to play with visualization tools hereafter. The “Edges” file contains a list of identifiers couples showing who follows who.

4. Importing the data into GEPHI

Run the application on your computer and create a “new project” in the start menu. In the Data Laboratory, click on “Import Spreadsheet” to open the import window and import your “nodes” file.

Nodes

Specify that the separation between your data is expressed by a semicolon and do not forget to inform Gephi that the data you import is related to nodes, as demonstrated in this example (left). Then press “next” and fill the import settings form as proposed (right).

Edges

Follow the same procedure as for the nodes, but with the “edges” file downloaded above and by filling the forms in the following manner: specify the semicolon and inform Gephi that this time you import the edges. Fill in the last fields, and uncheck “create missing nodes”, because you’ve already imported them.

5. Visualization!

The action now takes place on the overview panel. The software produces an overview of the graph, spatialized randomly (and completely unreadable).

Nodes’ size

In the “Ranking” panel of the left column (top), select “Nodes” and the “red diamond” (size), then select “Degree” (rolling menu) and enter the minimal and maximal value (we propose 10-150). At that point, it’s possible to click on the “Spline” blue link to edit the shape of the spline (Be aware that linearly double the radius of the nodes is more than double the area because of the power function).

Spatialization

That’s the main part! While it is possible to play (and lose yourself) with various visualization capabilities, I propose a method based on this dataset. Start with Fruchterman Reingold (left column, bottom), and use the same values as in this model (10000 10; 10).

This visualization disposes nodes in a gravitational way (attraction-repulsion, in fact, as magnets). You’re already able to distinguish communities (more densely connected parts of the network). Let the function run until the graph is stabilized. Use the little blue magnifying glass (bottom left of the graph panel) to re-center the zoom.

Then, I propose to use the Force Atlas 2 (another layout algorithm) to disperse groups and give space around larger nodes. Be careful, the parameters you enter significantly alter the final appearance (proposition: Check “prevent overlap” and change “Scaling” to 10). Let the function run until the graph is mostly stabilized.

Nodes’ color

In the Ranking panel, choose the “color” sign to remove these sad shades of gray. As the nodes have attributes, you can color them regarding their “sex” attribute or their “value” (or simply again with their degree centrality).

Please note that if you used the “Spline” for the Nodes’ size, this setting will be used by default here (but can be modified now without interfering with your previous choice).

Nodes’ label

At the bottom right of the graph display, you find a little sign which allows you to develop a new panel. In “Label“, check “Nodes” to add their labels to your nodes and set their font/color/size… If wanted, you can click on the “Configure” link to set the data you want to get displayed.

For privacy reasons, the names are all displayed as “Names” in this dataset. No doubt you understand. Moreover, as the data set has been built (and not collected in a rational manner), the data can not be subject to interpretation.

Final details

Go to “preview” for trimming the final details. Unlike during previous stages, changing settings in this menu is reversible, and do not affect the structure of the graph. In the this screenshot, you will find a suggestion of settings for a good rendering. Be aware that due to its large size, the graph may take a few seconds to update after each change (click on “refresh” to apply the changes).

At the bottom of this preview column, you find an export link. Note that exporting in .png produces figure with a poor resolution. You may want to opt for .svg or .pdf, which have the advantage of being modifiable by your own image/drawing software (I recommend the open source program inkscape for manipulating .svg files).

6. Other features

The visualization is only one step, network analysis often needs other mathematical means to provide the researcher with a satisfactory result. Feel free to explore the “Statistics” menu (right), for example by playing with degree measures, density, path length, modularity.

A network contains internal subdivisions called communities. There are methods that permit to highlight these communities, which depend on the comparison of the densities of edges within a group, and from the group towards the rest of the network. More here!

In the right column of the “overview” page, click on Statistics/Modularity/Run to display the modularity window. Choose a resolution (between 0.1 and 2), click OK and close it.

The next step takes place in the Partition menu situated in the left column. Select “Nodes” and “Modularity Class” (rolling menu). You will be then able to modify the colors attributed to the detected communities by clicking on them.

Do not hesitate to repeat this operation with many “Resolutions” ! If you decide to do so, you must deselect and reselect “Modularity Class” in the left column, and refresh color calculation.

7. Conclusion

Do not forget that what you see of GEPHI is just the tip of the iceberg because the application allows you to install very interesting plugins, has various tutorials and offers a very active forum.

I hope this tutorial has been a way to whet your curiosity to go further in social network analysis, and I am delighted to see your accomplishments!

More Gephi examples:

14 Comments

Claire Bertolini on 01/07/2013 at 20:26

Very interesting post ! To go further in SNA I recommend the MOOC “Social Network Analysis” by Lada Adamic (University of Michigan) on Coursera
Reply
- Martin Grandjean on 01/07/2013 at 22:00
  
  Thank you Claire for the recommendation! I put the link here for any interested: https://www.coursera.org/course/sna
  Reply
gdsaxton@buffalo.edu on 28/12/2013 at 06:32

Hi Martin, I’ve been browsing your site — great work! I love how beautiful your graphs are. I am especially interested in replicating something similar to what is in your Figure 1 above (tweets during a DH conference). I left a comment elsewhere in mangled French (smile) asking how you did that. Did you use a plug-in in Gephi to get the figure that way (with the Number of Tweets written, etc., in the right-hand column), or …? In any case, great work. Thanks for sharing. — Greg
Reply
Donal Phipps on 28/05/2014 at 21:34

Hi, Martin – thank you for this extremely helpful and interesting tutorial – I loved playing with the models in gephi and am looking forward to creating my own models. The explanation of nodes and edges was particularly helpful for me.
Thank you and all the best!
Reply
Ernesto Priego on 15/07/2014 at 10:06

This is a great resource; much clearer and easier to follow than others I had seen. I was sorry not to be in Lausanne for the workshops! Will be sharing with my students. Thanks a lot.
Reply
Lene Søemod on 16/11/2014 at 08:52

Hi Martin. You’ve created a beautiful site with a very helpful introduction. Thank you very much. My problem is how to prepare the data spreadsheet? Can you please give some advice on that?
Reply
- Martin Grandjean on 18/11/2014 at 09:16
  
  Hello, and thank you for your interest!
  Of course, the preparation of the spreadsheet is the most important and time consuming point of a network analysis. I have no general advice for you, because it’s always a very personal approach, closely related to the nature of your data. But try always to think it like a list of pairs of individuals (the list of edges, you can download the example-file at pt.3). If the dataset is very large, it’s hard to built it from A to Z, but usually I simply fill an excel spreadsheet with those pairs of node, and export it in .csv
  Reply
Peter Simon on 16/12/2014 at 20:11

Thank you very much for the marvelous tutorial! I have been avoiding GEPHI because I assumed there was a very steep learning curve and no good information on how to use it.
Reply
- Martin Grandjean on 16/12/2014 at 20:39
  
  I’m really glad it could be useful!
  Reply
Fred Eisele on 28/05/2015 at 17:05

Has anyone used Gephi to visualize a large system function call graph?
Reply
Lucas on 08/10/2015 at 04:16

I would like to ask about a doubt. I tried to do a view of my network of my friends in Facebook using Netvizz, but doesn’t appear the options personal network, that’s why I haven’t got the view. Are there some way I can get this view??
Reply
Diane Gal on 11/12/2015 at 14:48

Thank you. The cartography work you have presented is very well done. I have been trying to use GEPHI with geolayout and map of countries to place network data on a global map but the latitudes and longitudes (centroid and capital cities) end up falling outside of most of the countries in map of countries. Would you be able to suggest any solutions? Or have any suggestions to use a different programme to place the network data on the maps?
Reply
- Martin Grandjean on 14/12/2015 at 08:27
  
  Hello, thank you for your message (and sorry for the late answer)! Please, have a look at the new version of this tutorial, including some GeoLayout experiments: http://www.martingrandjean.ch/gephi-introduction/ It may help you solve your problem.
  If not, could you perhaps describe it more precisely? You seems to have a problem of scale and/or projection, are you working on an editor (as Illustrator/Inkscape) after Gephi?
  Reply
shanaire on 04/04/2016 at 17:12

Thanks for this tutorial, it had indeed made thing clear on how to get started with this software. Thanks again.
Reply

Trackbacks/Pingbacks

Digital Humanities Swiss Summer School on Twitter | Pegasus Data Project - [...] the spirit of the Gephi tutorial given at this conference (the workshop has been published here), here is a…
Introduction to Network Visualization with GEPH... - [...] Actualités, Humanités, Société [...]
A lire et à consulter ailleurs (08/07/2013) | Digital Humanities Toulouse - [...] “Introduction to Network Visualization with GEPHI” par Martin Grandjean (le blog… Martin Grandjean) [...]
DATA VISUALIZATION | Pearltrees - [...] Introduction to Network Visualization with GEPHI [...]
Gephi – curated list of tutorials | Insights @exploreyourdata - [...] Introduction to network visualization with Gephi by Martin GrandJean All the basics explained in one single web page with clear…
Introduction to Network Visualization with GEPH... - [...] Social Network Analysis (SNA) appears to be an interesting tool to give the researcher the ability to see its…
@lukasnet - Introduction to Network Visualization with GEPHI http://t.co/AC1M2viSxX
@pbellot_LSIS - Introduction to Network Visualization with GEPHI http://t.co/G8hCy6KqdG via @GrandjeanMartin
@juanan - Introduction to Network Visualization with GEPHI http://t.co/WV2XZqsdJq via @GrandjeanMartin
2014 | Pearltrees - [...] Introduction to Network Visualization with GEPHI [...]
Frédéric Clavert (@inactinique) - @ProfessMoravec I use gephi a lot. Have you read @GrandjeanMartin tutorial? An excellent and simple introduction: http://t.co/LKTHQp9b2R
Introduction to Network Visualization with GEPHI – Martin Grandjean | Icky Pharmacy - [...] via Introduction to Network Visualization with GEPHI – Martin Grandjean. [...]
Social Network Analysis using Gephi | My exploration in data analytics - […] see more on using Gephi for Data Visualization for Social Network Analysis. Please read through a very good introduction…
Popular Network Mapping | Circling Toward Disseratation - […] Katie Borner uses this. It was also used in the SNA MOOC (Coursera?) This page shows that it can…
An exercise in text mining and distant reading: Helmut Schmidt’s visit to the Bundesbank in 1978 - […] between words (similitude analysis); the website is explaining well how the software works, but Martin Grandjean’s introduction to Gephi…
@yrochat - @Munsterma @luefkens @Gephi try this tuto by @GrandjeanMartin http://t.co/rElGvZL78K
Introduction to Network Visualization with GEPH... - […] Social Network Analysis (SNA) appears to be an interesting tool to give the researcher the ability to see its…
Veille et community management : outils & usages | Pearltrees - […] Introduction to Network Visualization with GEPHI […]
@antounh - Introdução à visualização da rede com o GEPHI http://t.co/kN2SRH8Gqh
@mariona_lmi - Introduction to Network Visualization with GEPHI by @GrandjeanMartin via @Decalapa http://t.co/8kClOtBt8c
@lmiub - Introduction to Network Visualization with GEPHI by @GrandjeanMartin via @Decalapa http://t.co/I95KUmnUPj
Com sabrem si col·laborem més i millor, i quin és el nostre límit? | nexus24upc - […] que defineixen la topologia i propietats de les xarxes. Si tinguéssim temps, aquest llibre, aquesta web o aquest software de ben segur…
@manuelcorujeira - Introduction to Network Visualization with GEPHI http://t.co/1dR0Ypq2kJ vía @GrandjeanMartin
Social Network Analysis | Pearltrees - […] Introduction to Network Visualization with GEPHI […]
DataViz | Pearltrees - […] y Clasificación, en la segunda está el grafo, y en la derecha contexto, estadísticas y filtros. Introduction to Network…
AWPR14 | L’intervention de Martin Grandjean | A l’écoute du web protestant romand - […] Le tutoriel marche bien… sur la durée (ex. de celui réalisé sur Gephi): http://www.martingrandjean.ch/introduction-to-network-visualization-gephi/ […]
DH #2.5: Experiment s Twitter sítí a Gephi | Leni Krsová - […] inventář, užijete si spoustu velice kvalitní datové legrace. Vřele doporučuju si pročíst úvod do vizualizací s Gephi na stránce…
Social Network Analysis | Pearltrees - […] is a great companion for Excel, Numbers or any of the big statistical packages. WikiViz. Introduction to Network Visualization…
Resum del 3r taller | curs Open Data Catalunya Dades / UOC - […] Gephi permet calcular i visualitzar diferents mètriques sobre els nodes, de forma que és possible classificar-los d’acord a diferents…
Socio du numérique | Pearltrees - […] Introduction to Network Visualization with GEPHI. Gephi workshop at University of Bern (photo Radu Suciu) Social Network Analysis is…
@bender_k - Something to explore for art history: 'Introduction to Network Visualization with GEPHI' http://t.co/tmVKaXL9KC via @GrandjeanMartin
Getting Started – First Steps into DH | Ted Howell - […] they will be helpful as I start to figure out what I’m doing. Martin Grandjean’s “Introduction to Network Visualization…
GePhi | Pearltrees - […] by yours truly. Gephi: A video tutorial by Stratidev (in French) A Youtube video in 15 minutes. Introduction to…
@Nexus24UPC - Treballem i aprenem!, Sí se puede!! TAG Introduction to Network Visualization with GEPHI http://t.co/uprAN7GmjQ
Visualisation - Code | Pearltrees - […] Modest Maps is a small, extensible, and free library for designers and developers who want to use interactive maps…
@SolGamsu - @cgsloan Hi! Yes I'm basically self-taught which can be fiddly. Lots avail online though, e.g. http://t.co/eJfKAsI8xa http://t.co/75fTvOSgUz
@khalifahashim - Introduction to Network Visualization with GEPHI http://t.co/jPXScrqzA6 via @GrandjeanMartin
Introduction to Network Visualization: Part 1 (Gephi) | Digital Project Studio - […] out communities in a network with Gephi:Martin Grandjean’s Introduction to Gephi shows that you can employ a combination of…
Introduction to Network Visualization with GEPH... - […] Digital humanities, Data visualization, Network analysis […]
@ProjetCellie - Introduction to Network Visualization with GEPHI http://t.co/NreLiUNPCt #datavisualization #BigData
Clement Levallois (@seinecle) - @FCTweedie @williamjturkel @Adam_Crymble just in case, @GrandjeanMartin released great tutorials for @Gephi: http://t.co/JoidpYqRPd
@rafaelsidi - Visualization with GEPHI #value #1pq2w http://t.co/GvxwXB47jh
@steve_ranford - Nice tutorial for #dhwarwick: Introduction to Network Visualization with GEPHI http://t.co/9v8EDJBA2z via @grandjeanmartin
MIndmapping | Pearltrees - […] Introduction to Network Visualization with GEPHI. Gephi workshop at University of Bern (photo Radu Suciu) Social Network Analysis is…
Visualization with GEPHI - […] Source: Introduction to network visualization with GEPHI […]
@dataspirin - https://t.co/lmVPBI8res
RESEARCH : BIG DATA ANALYSIS – DATASETS (Assignment 6) | wirawanrizkika - […] http://www.martingrandjean.ch/introduction-to-network-visualization-gephi/ […]
@zhijun_yin - Introduction to Network Visualization with GEPHI https://t.co/jg0WYXPwKZ via @grandjeanmartin
Syllabus “The Humanist in the Computer: Digital Humanities and Social Justice” COLT 18.02 (Winter 17) – Dr. Kirstyn Leuner, Postdoctoral Fellow (Dartmouth College) - […] Analysis in the Struggle for Social Justice” (PDF provided); Optional – Martin Grandjean “Introduction to Network Visualization with GEPHI”;…
@nfigay - Introduction to Network Visualization with GEPHI https://t.co/fmFx3zA1Bs
@immaperez - #Data #Mapping - Introduction to Network Visualization with GEPHI https://t.co/Yw3BLZV7FF vía @grandjeanmartin
Visualisation du graphe de mes amis facebook - Martouf le Synthéticien - […] C’est ici qu’il faut que je remercie Martin Grandjean de m’avoir fait connaitre l’outil Gephi et son utilisation… […]
Herramientas InfoVis | Pearltrees - […] No knowledge of HTML, CSS, JavaScript is required. Code is written in R. Datavisualization.ch Selected Tools. Graphviz. RAWGraphs. Dataviz.tools.…
@acerami - Introduction to Network Visualization with GEPHI https://t.co/6wQEL5b1OS via @grandjeanmartin
Introduction to Network Visualization with GEPHI – Martin Grandjean – Epistemic Therapy - […] via Introduction to Network Visualization with GEPHI – Martin Grandjean. […]
Data Visualization with Gephi – Hacking the Humanities 2020 - […] Martin Grandjean’s Introduction to Gephi […]

NEWSLETTER

SOCIAL

twitter 2 icon TWITTER

facebook 2 icon FACEBOOK

youtube icon YOUTUBE

linkedin 2 icon LINKEDIN

instagram icon INSTAGRAM

learn icon SCHOLAR

RECENT POSTS

Introduction to Network Visualization with GEPHI

1. Short introduction to Social Network Analysis

2. Graphs with GEPHI: some examples

3. Downloading GEPHI and the dataset

4. Importing the data into GEPHI

Nodes

Edges

5. Visualization!

Nodes’ size

Spatialization

Nodes’ color

Nodes’ label

Final details

6. Other features

7. Conclusion

More Gephi examples:

Related

14 Comments

Trackbacks/Pingbacks

Comment this postCancel reply

NEWSLETTER

SOCIAL

twitter 2 icon TWITTER

facebook 2 icon FACEBOOK

youtube icon YOUTUBE

linkedin 2 icon LINKEDIN

instagram icon INSTAGRAM

learn icon SCHOLAR

RECENT POSTS