Bloomberg Visual Data

Excelente exemplo de visualização de dados interactiva

Excelente exemplo de visualização de dados interactiva

Billionaires of the world ranked and charted

Jan 23, 2013 10:47 am

How wealthy are the richest people in the world? How do they compare to each other, and how does their net worth change over time? Bloomberg just put up an interactive tool to answer such questions, and it’s updated daily with new data.

There are four main views. The one above shows rankings, their estimated net worth, and the change from the previous estimate. Below is a simple ranking of the world’s billionaires. Each floating head is clickable so that you can more information about the individuals, such as a short bio and where there money is from.

It gets more interesting when you click around and explore. For example, there’s a plotting view, and the floating heads transition to their sectors, still sorted by ranking. There’s also a bubble map that you can modify to show the metric you’re interested in.

Finally, a set of filters and a time slider on the bottom ties it all together. Filter by gender, industry, citizenship, age, and whether or not a billionaire’s money was mostly inherited. The slider on the bottom allows you to go back in time to see rankings and net worth change. That part did seem buggy though, as heads seem to disappear or get stuck if you shift too much.

Overall: There’s a lot of interesting things to look at and explore, and it works well as a tool. The next steps would probably be to provide pointers and annotation since you have to do most of the searching yourself in this form (but I don’t think that’s what they were going for).

Tags: , ,

Relação espúria

Um claro exemplo de uma relação estatísticamente significativa mas sem qq significado prático

Um claro exemplo de uma relação estatisticamente significativa mas sem qq significado prático

Tags: , ,

Global temperature rises over past century

Boa visualização do aumento de temperaturas médias mundial

Boa visualização do aumento de temperaturas médias mundial

New Scientist mapped global temperature change based on a NASA GISTEMP analysis.

The graphs and maps all show changes relative to average temperatures for the three decades from 1951 to 1980, the earliest period for which there was sufficiently good coverage for comparison. This gives a consistent view of climate change across the globe. To put these numbers in context, the NASA team estimates that the global average temperature for the 1951-1980 baseline period was about 14 °C.

The more red an area the greater the increase was estimated to be, relative to estimates for 1951 to 1980 (especially noticeable in the Northern Hemisphere).

The most interesting part is when you compare all the way back to to the 19th century when it was much cooler. You can also click on locations for a time series of five-year averages.

Tags: , ,

Creating Animations and Transitions With D3

Construção de visualizações de dados em java

Construção de visualizações de dados em java

In interactive visualisation, there is the word reactive. Well, maybe not literally, but close enough.

The fact is that reactivity, or the propension of a visualisation to respond to user actions, can really help engage the user in a visualisation, and help them understand its results. Both of which are usually good things. How can this reactivity be achieved? Through animations.

So I’ll go ahead and state that animation, if done right, can make any interactive data visualization better.

How is that?

  • When coupled with interaction, it’s a very useful way to give feedback to the user. What has changed since their last command? If what’s on screen animates from one state to another, it’s obvious, it stands out and it makes sense. Or, when showing any form of real-time data, animation is pretty much required.
  • Animation can bring focus on the important things as a chart loads. Our vision is very sensitive to movement, so using these introduction transitions sensibly helps a lot to ease the effort required to get the right information off a chart.

Tags: , ,

Women as Academic Authors, 1665-2010

Exemplo de representação gráfica interativa com bolhas num eixo

Exemplo de representação gráfica interativa com bolhas num eixo

The Chronicle of Higher Education has a look at the percentage of academic papers published by women, over the past five centuries.

The articles and authors described in this data were drawn from the corpus of JSTOR, a digital archive of scholarly papers, by researchers at the Eigenfactor Project at the University of Washington. About two million articles, representing 1765 fields and sub-fields, were examined, spanning a period from 1665 to 2011. The data are presented here for three time periods, the latest one ending in 2010, and a view that combines all periods.

Percentage of female authors is on the horizontal, and each bubble is a subfield sized by total number of authors. The graphic starts with publishing for all years, but be sure to click on the tabs for each time span to see changes.

The data is based on the archive of about two million articles from JSTOR, and a hierarchical map equation method is used to determine subfields.

The gender classification they used for names seems like it could be nifty for some applications. Gender is inferred by comparing names against the ones kept by the U.S. Social Security Administration, which includes gender. If a name was used for female at least 95 percent of the time, it was classified as a female name, and the same was done with male. Anything ambiguous was not included in the study

Tags: , , ,

Introductory Statistics

Muito boas aulas e slides sobre testes não paramétricos e SPSS

Muito boas aulas e slides sobre testes não paramétricos e SPSS

Welcome to Limbo, where the lustful, gluttonous and wrathful wander in endless torment. Here you can uncover the searing agony of SPSS, the stomach churning fear of central tendency and the rancid bile of z-scores. Good luck, you’ll need it.

Tags: , , , , , , , ,

PANDA – Practical Analysis of Nutritional Data

Bom livro sobre análise de dados e testes estatísticos

Bom livro sobre análise de dados e testes estatísticos


Chapter 1 –
USING NUTRITIONAL DATA
Chapter 2 –
DATA CLEANING

Chapter 3 –
ONE-WAY ANALYSIS

Chapter 4 –
TWO-WAY ANALYSIS

Chapter 5 –
MULTI-WAY ANALYSIS

Chapter 6 –
SUBMODULES

Chapter 7 – ASSESS YOUR UNDERSTANDING
Chapter 8
KENYA REAL WORLD ANALYSIS

Building and presenting a situation analysis
Child Feeding Practices

Tags: , , , , ,

Determining the Number of Components and Factors Using Parallel Analysis and Velicer’s MAP Test

Softwere para análise do nº de componentes em ACP e AF

Softwere para análise do nº de componentes em ACP e AF

Popular statistical software packages do not have the proper procedures for determining the number of components or factors in correlation matrices. Parallel analysis and Velicer’s minimum average partial (MAP) test are validated procedures that are widely recommended by statisticians. This paper described brief and efficient programs for conducting parallel analyses and the MAP test using SPSS, SAS, and MATLAB.

Métodos para facilitar a interpretação da AFE

Métodos para facilitar a interpretação da AFE

Scale development using popular statistical software packages often produces results that are baffling or misunderstood by many users, which can lead to inappropriate substantive interpretations and item selection decisions. High internal consistencies do not indicate unidimensionality; item-total correlations are inflated because each item is correlated with its own error as well as the common variance among items; and the default number-of-eigenvalues-greater-than-one rule, followed by principal components analysis and varimax rotation, produces inflated loadings and the possible appearance of numerous uncorrelated factors for items that measure the same construct (Gorsuch, 1997a, 1997b). Concerned investigators may then neglect the higher order general factor in their data as they use misleading statistical output to trim items and fashion unidimensional scales.

These problems can be circumvented in exploratory factor analysis by using more appropriate factor analytic procedures and by using extension analysis as the basis for adding items to scales. Extension analysis provides correlations between nonfactored items and the factors that exist in a set of core items. The extension item correlations are then used to decide which factor, if any, a prospective item belongs to. The decisions are unbiased because factors are defined without being influenced by the extension items. One can also examine correlations between extension items and any higher order factor(s) in the core items. The end result is a comprehensive, undisturbed, and informative picture of the correlational structure that exists in a set of core items and of the potential contribution and location of additional items to the structure.

Tags: , , , , ,

Correspondence Analysis

Bom texto sobre análise de correspondências

Bom texto sobre análise de correspondências

1. Introduction
2. Overview of the Method

3. The Analysis I

4. The Analysis II

5. What you Find in the Output and How to Use it

6. Graphical Representations and Interpretation

7. Extensions and Limitations

8. Bibliography

Tags: ,

Pitfalls of Data Analysis

Como evitar erros e viéses na análise de dados

Como evitar erros e viéses na análise de dados

There seems to be a pervasive notion that “you can prove anything with statistics.” This is only true if you use them improperly. In this workshop we’ll discuss things that people often overlook in their data analysis, and ways people sometimes “bend the rules” of statistics to support their viewpoint. We will also discuss ways you can make sure your own statistics are clear and accurate. I will include examples from medicine, education, and industry.

Tags: , ,