Data visualization for research

Data visualization is definitely my favorite part of research. I am not a designer and have not received formal training in data visualization, but it’s something I deeply enjoy and it is one of the tasks that I do when I want to feel productive but I don’t feel like writing or responding to emails.

The images I’ve collected in this post are some of my favorites from different papers. Some are meant to explain a complicated finding. Others try to capture a messy idea or illustrate the full analytical pipeline.

The first figure is a classic in my research, which is using sequence analysis to illustrate longitudinal trajectories of categorical variables. This one specifically is from a recent paper in Computers in Human Behaviour (López-Pernas and Saqr 2024). I use the TraMineR package for this. For the transitions, I use tna, our own package for Transition Network Analysis, based on qgraph. I love this figure not only because I think it is very aesthetically pleasing but because it allows you to see the whole picture immediately. I have used many of such figures in my research using the VaSStra method, to which I devoted another post.

Another type of figure that my colleagues really like is the alluvial plot. I have shared a tutorial on how to create it with ggalluvial in a previous post. The example below is meant to illustrate transitions between (categorical) answers to a pre-test and post-test about AI in social media (Vartiainen et al. 2024).

One of my latest favorites is this image from our recent study on LLM-generated responses to the MSLQ questionnaire (Vogelsmeier et al. 2025). For this one, I used my go-to package ggplot2. The code will be available once the article is published. The figure represents the factor loadings from Exploratory Factor Anaysis, with columns representing extracted factors (f1 – f7 on the x-axis) and rows grouped by theoretical constructs of MSLQ.

Lastly, I often create figures that summarize the analytical pipeline in my papers, so readers can understand the methods used easily. I don’t use any fancy tools, just Google Slides and a lot of patience.

For example, the following figure illustrates three levels of sequence analysis clustering, following VaSSTra, part of one of my early papers in learning analytics (Saqr et al. 2023).

For my recent book, I also created many of these figures. For example, for the chapter on eXplainable AI using LLMs (López-Pernas et al. 2025).

Shoutout to the Free Pople Icons for Diverse & Inclusive Designs for making their icons available freely. I use them all the time.

References

López-Pernas, Sonsoles, and Mohammed Saqr. 2024. “How the Dynamics of Engagement Explain the Momentum of Achievement and the Inertia of Disengagement: A Complex Systems Theory Approach.” Comput. Human Behav. 153 (April): 108126. https://doi.org/10.1016/j.chb.2023.108126.

López-Pernas, Sonsoles, Yige Song, Eduardo Oliveira, and Mohammed Saqr. 2025. “LLMs for Explainable Artificial Intelligence: Automating Natural Language Explanations of Predictive Analytics Models.” In Advanced Learning Analytics Methods: AI, Precision and Complexity, edited by Mohammed Saqr and Sonsoles López-Pernas. Cham: Springer Nature Switzerland. https://lamethods.github.io/book2/chapters/ch11-llmsxai/ch11-llmsxai.html.

Saqr, Mohammed, Sonsoles López-Pernas, Jelena Jovanović, and Dragan Gašević. 2023. “Intense, Turbulent, or Wallowing in the Mire: A Longitudinal Study of Cross-Course Online Tactics, Strategies, and Trajectories.” Internet High 57: 100902. https://doi.org/10.1016/j.iheduc.2022.100902.

Vartiainen, Henriikka, Juho Kahila, Matti Tedre, Sonsoles López-Pernas, and Nicolas Pope. 2024. “Enhancing Children’s Understanding of Algorithmic Biases in and with Text-to-Image Generative AI.” New Media Soc., May. https://doi.org/10.1177/14614448241252820.

Vogelsmeier, Leonie V D E, Eduardo Oliveira, Kamila Misiejuk, Sonsoles López-Pernas, and Mohammed Saqr. 2025. “Delving into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses.” arXiv [Cs.AI], June. http://arxiv.org/abs/2506.13384.

Citation

BibTeX citation:

@misc{lópez-pernas2025,
  author = {López-Pernas, Sonsoles},
  title = {Data Visualization for Research},
  date = {2025-07-30},
  url = {https://sonsoleslp.github.io/posts/visualization/},
  langid = {en}
}

For attribution, please cite this work as:

López-Pernas, Sonsoles. 2025. “Data Visualization for Research.” https://sonsoleslp.github.io/posts/visualization/.