Armando B. Mendes

Bloomberg Visual Data

Posted by Armando Brito Mendes | Filed under estatística, visualização

Excelente exemplo de visualização de dados interactiva

Billionaires of the world ranked and charted

Jan 23, 2013 10:47 am

How wealthy are the richest people in the world? How do they compare to each other, and how does their net worth change over time? Bloomberg just put up an interactive tool to answer such questions, and it’s updated daily with new data.

There are four main views. The one above shows rankings, their estimated net worth, and the change from the previous estimate. Below is a simple ranking of the world’s billionaires. Each floating head is clickable so that you can more information about the individuals, such as a short bio and where there money is from.

It gets more interesting when you click around and explore. For example, there’s a plotting view, and the floating heads transition to their sectors, still sorted by ranking. There’s also a bubble map that you can modify to show the metric you’re interested in.

Finally, a set of filters and a time slider on the bottom ties it all together. Filter by gender, industry, citizenship, age, and whether or not a billionaire’s money was mostly inherited. The slider on the bottom allows you to go back in time to see rankings and net worth change. That part did seem buggy though, as heads seem to disappear or get stuck if you shift too much.

Overall: There’s a lot of interesting things to look at and explore, and it works well as a tool. The next steps would probably be to provide pointers and annotation since you have to do most of the searching yourself in this form (but I don’t think that’s what they were going for).

Tags: análise de dados, data mining, Estat Descritiva

Read more | Comments off | January 28th, 2013

Relação espúria

Posted by Armando Brito Mendes | Filed under estatística, visualização

Um claro exemplo de uma relação estatísticamente significativa mas sem qq significado prático

Um claro exemplo de uma relação estatisticamente significativa mas sem qq significado prático

Tags: análise de dados, data mining, Estat Descritiva

Read more | Comments off | January 28th, 2013

Global temperature rises over past century

Posted by Armando Brito Mendes | Filed under estatística, SAD - DSS, visualização

Boa visualização do aumento de temperaturas médias mundial

New Scientist mapped global temperature change based on a NASA GISTEMP analysis.

The graphs and maps all show changes relative to average temperatures for the three decades from 1951 to 1980, the earliest period for which there was sufficiently good coverage for comparison. This gives a consistent view of climate change across the globe. To put these numbers in context, the NASA team estimates that the global average temperature for the 1951-1980 baseline period was about 14 °C.

The more red an area the greater the increase was estimated to be, relative to estimates for 1951 to 1980 (especially noticeable in the Northern Hemisphere).

The most interesting part is when you compare all the way back to to the 19th century when it was much cooler. You can also click on locations for a time series of five-year averages.

Tags: análise de dados, data mining, Estat Descritiva

Read more | Comments off | January 18th, 2013

Creating Animations and Transitions With D3

Posted by Armando Brito Mendes | Filed under materiais para profissionais, SAD - DSS, visualização

Construção de visualizações de dados em java

In interactive visualisation, there is the word reactive. Well, maybe not literally, but close enough.

The fact is that reactivity, or the propension of a visualisation to respond to user actions, can really help engage the user in a visualisation, and help them understand its results. Both of which are usually good things. How can this reactivity be achieved? Through animations.

So I’ll go ahead and state that animation, if done right, can make any interactive data visualization better.

How is that?

When coupled with interaction, it’s a very useful way to give feedback to the user. What has changed since their last command? If what’s on screen animates from one state to another, it’s obvious, it stands out and it makes sense. Or, when showing any form of real-time data, animation is pretty much required.
Animation can bring focus on the important things as a chart loads. Our vision is very sensitive to movement, so using these introduction transitions sensibly helps a lot to ease the effort required to get the right information off a chart.

Tags: análise de dados, data mining, desnvolvimento de software

Read more | Comments off | January 18th, 2013

Women as Academic Authors, 1665-2010

Posted by Armando Brito Mendes | Filed under estatística, visualização

Exemplo de representação gráfica interativa com bolhas num eixo

The Chronicle of Higher Education has a look at the percentage of academic papers published by women, over the past five centuries.

The articles and authors described in this data were drawn from the corpus of JSTOR, a digital archive of scholarly papers, by researchers at the Eigenfactor Project at the University of Washington. About two million articles, representing 1765 fields and sub-fields, were examined, spanning a period from 1665 to 2011. The data are presented here for three time periods, the latest one ending in 2010, and a view that combines all periods.

Percentage of female authors is on the horizontal, and each bubble is a subfield sized by total number of authors. The graphic starts with publishing for all years, but be sure to click on the tabs for each time span to see changes.

The data is based on the archive of about two million articles from JSTOR, and a hierarchical map equation method is used to determine subfields.

The gender classification they used for names seems like it could be nifty for some applications. Gender is inferred by comparing names against the ones kept by the U.S. Social Security Administration, which includes gender. If a name was used for female at least 95 percent of the time, it was classified as a female name, and the same was done with male. Anything ambiguous was not included in the study

Tags: análise de dados, captura de conhecimento, data mining, Estat Descritiva

Read more | Comments off | January 9th, 2013

Introductory Statistics

Posted by Armando Brito Mendes | Filed under estatística, materiais ensino, videos, visualização

Muito boas aulas e slides sobre testes não paramétricos e SPSS

Welcome to Limbo, where the lustful, gluttonous and wrathful wander in endless torment. Here you can uncover the searing agony of SPSS, the stomach churning fear of central tendency and the rancid bile of z-scores. Good luck, you’ll need it.

Tags: análise de dados, captura de conhecimento, data mining, decisão médica, Estat Descritiva, IBM SPSS Statistics, inferência, inquéritos, software estatístico

Read more | Comments off | January 9th, 2013

PANDA – Practical Analysis of Nutritional Data

Posted by Armando Brito Mendes | Filed under estatística, materiais ensino, software, visualização

Bom livro sobre análise de dados e testes estatísticos

Chapter 1 – USING NUTRITIONAL DATA
Chapter 2 – DATA CLEANING
Chapter 3 – ONE-WAY ANALYSIS
Chapter 4 – TWO-WAY ANALYSIS
Chapter 5 – MULTI-WAY ANALYSIS
Chapter 6 – SUBMODULES
Chapter 7 – ASSESS YOUR UNDERSTANDING
Chapter 8 – KENYA REAL WORLD ANALYSIS

Building and presenting a situation analysis
Child Feeding Practices

Tags: análise de dados, decisão médica, Estat Descritiva, IBM SPSS Statistics, inferência, software estatístico

Read more | Comments off | January 9th, 2013

Determining the Number of Components and Factors Using Parallel Analysis and Velicer’s MAP Test

Posted by Armando Brito Mendes | Filed under estatística, software, visualização

Softwere para análise do nº de componentes em ACP e AF

Popular statistical software packages do not have the proper procedures for determining the number of components or factors in correlation matrices. Parallel analysis and Velicer’s minimum average partial (MAP) test are validated procedures that are widely recommended by statisticians. This paper described brief and efficient programs for conducting parallel analyses and the MAP test using SPSS, SAS, and MATLAB.

Métodos para facilitar a interpretação da AFE

Scale development using popular statistical software packages often produces results that are baffling or misunderstood by many users, which can lead to inappropriate substantive interpretations and item selection decisions. High internal consistencies do not indicate unidimensionality; item-total correlations are inflated because each item is correlated with its own error as well as the common variance among items; and the default number-of-eigenvalues-greater-than-one rule, followed by principal components analysis and varimax rotation, produces inflated loadings and the possible appearance of numerous uncorrelated factors for items that measure the same construct (Gorsuch, 1997a, 1997b). Concerned investigators may then neglect the higher order general factor in their data as they use misleading statistical output to trim items and fashion unidimensional scales.

These problems can be circumvented in exploratory factor analysis by using more appropriate factor analytic procedures and by using extension analysis as the basis for adding items to scales. Extension analysis provides correlations between nonfactored items and the factors that exist in a set of core items. The extension item correlations are then used to decide which factor, if any, a prospective item belongs to. The decisions are unbiased because factors are defined without being influenced by the extension items. One can also examine correlations between extension items and any higher order factor(s) in the core items. The end result is a comprehensive, undisturbed, and informative picture of the correlational structure that exists in a set of core items and of the potential contribution and location of additional items to the structure.

Tags: análise de dados, captura de conhecimento, decisão médica, desnvolvimento de software, IBM SPSS Statistics, software estatístico

Read more | Comments off | January 8th, 2013

Correspondence Analysis

Posted by Armando Brito Mendes | Filed under estatística

Bom texto sobre análise de correspondências

1. Introduction
2. Overview of the Method

3. The Analysis I

4. The Analysis II

5. What you Find in the Output and How to Use it

6. Graphical Representations and Interpretation

7. Extensions and Limitations

8. Bibliography

Tags: análise de dados, inferência

Read more | Comments off | January 7th, 2013

Pitfalls of Data Analysis

Posted by Armando Brito Mendes | Filed under estatística

Como evitar erros e viéses na análise de dados

There seems to be a pervasive notion that “you can prove anything with statistics.” This is only true if you use them improperly. In this workshop we’ll discuss things that people often overlook in their data analysis, and ways people sometimes “bend the rules” of statistics to support their viewpoint. We will also discuss ways you can make sure your own statistics are clear and accurate. I will include examples from medicine, education, and industry.

Tags: análise de dados, Estat Descritiva, inferência

Read more | Comments off | January 7th, 2013

« Older Entries

Newer Entries »

Bloomberg Visual Data

Billionaires of the world ranked and charted

Relação espúria

Global temperature rises over past century

Creating Animations and Transitions With D3

Women as Academic Authors, 1665-2010

Introductory Statistics

PANDA – Practical Analysis of Nutritional Data

Determining the Number of Components and Factors Using Parallel Analysis and Velicer’s MAP Test

Correspondence Analysis

Pitfalls of Data Analysis

Categorias de Posts

Palavras chave mais usadas

Arquivo

Recent Posts

Recent Comments

About