BEAKER The data scientist’s laboratory

Um ambiente de programação q aceita várias linguagens

Um ambiente de programação q aceita várias linguagens

Beaker is a code notebook that allows you to analyze, visualize, and document data using multiple programming languages including Python, R, Groovy, Julia, and Node. Beaker’s plugin-based polyglot architecture enables you to seamlessly switch between languages and add support for new languages.

Tags: , ,

Resources for Getting Started with R

Recursos para começar a usar o R

Recursos para começar a usar o R

Resources for Getting Started with R

June 4, 2012  |  Software

R, the open source statistical software environment, is powerful but can be a challenge to approach for beginners. For me, the best way to learn R, especially on the visualization side of things, is to dive right in. Grab some data and make some charts, or better yet, find a graph you like and try to replicate it.

R core functionality and the many available packages let you do a lot without having to know what’s going on underneath. I use this approach in Visualize This and the tutorials around here. I like the satisfaction of immediate results. Then I learn the nitty gritty later.

That said, it doesn’t hurt to familiarize yourself with the environment. Also, visualization is a small part of what you can do with R, so it can help to know what else you can do analysis-wise.

Tags: , ,

An intro to R for new programmers

Introdução às estruturas de dados no R

Introdução às estruturas de dados no R

Following the lead of JavaScript for Cats by Maxwell Ogden, Scott Chamberlain and Carson Sievert wrote R for Cats. It’s a playful introduction to R intended for those who have little to no programming experience.

The bulk of it so far is a primer on data structures, and there’s a little bit on functions and some dos and don’ts. It’s stuff you should know before you get into more advanced tutorials.

Mainly though: ooo look, kitty.

Once you’re done with that (It only takes about 30 minutes.), there are lots of other resources for getting started with R.

Tags: , ,

Create a barebones R package from scratch

Criar pacotes para o R é muito fácil

Criar pacotes para o R é muito fácil

While we’re on an R kick, Hilary Parker described how to create an R package from scratch, not just to share code with others but to save yourself some time on future projects. It’s not as hard as it seems.

This tutorial is not about making a beautiful, perfect R package. This tutorial is about creating a bare-minimum R package so that you don’t have to keep thinking to yourself, “I really should just make an R package with these functions so I don’t have to keep copy/pasting them like a goddamn luddite.” Seriously, it doesn’t have to be about sharing your code (although that is an added benefit!). It is about saving yourself time. (n.b. this is my attitude about all reproducibility.)

I need to do this. I’ve been meaning to wrap everything up for a while now, but it seemed like such a chore. Sometimes I’d even go back to my own tutorials for some copy and paste action. Now I know better. And that’s half the battle.

Tags: , ,

Using R in Nonparametric Statistical Analysis

Blog com vários tutoriais para usar estatísticas não paramétricas simples em R

Blog com vários tutoriais para usar estatísticas não paramétricas simples em R

Tags: , ,

Why use R? Five reasons

Bom blogue, as principais razões para usar R

Bom blogue, as principais razões para usar R

Why use R? Five reasons.

In this post I will go through 5 reasons: zero cost, crazy popularity, awesome power, dazzling flexibility, and mind-blowing support. I believe R is the best statistical programming language to learn. As a blogger who has contributed over 150 posts in Stata and over 100 in R I have extensive experience with both a proprietary statistical programming language as well as the open source alternative.  In my graduate career I have also had the opportunity to experiment with the proprietary software SPSS, SAS, Mathematica, as well as MPlus.

Tags: , , ,

Read Histograms and Use Them in R

Bom tutorial para construir histogramas no R

Bom tutorial para construir histogramas no R

Tutorials,

How to Read Histograms and Use Them in R

By Nathan Yau
The chart type often goes overlooked because people don’t understand them. Maybe this will help.

The histogram is one of my favorite chart types, and for analysis purposes, I probably use them the most. Devised by Karl Pearson (the father of mathematical statistics) in the late 1800s, it’s simple geometrically, robust, and allows you to see the distribution of a dataset.

If you don’t understand what’s driving the chart though, it can be confusing, which is probably why you don’t see it often in general publications.

Tags: , , , ,

Data Intelligence and Analytics Resources

Excelentes textos sobre ciencia dos dados e big data

Excelentes textos sobre ciencia dos dados e big data

3. Big Data

4. Visualization

5. Best and Worst of Data Science

6. New Analytics Start-up Ideas

7. Rants about Healthcare, Education, etc.

8. Career Stuff, Training, Salary Surveys

9. Miscellaneous

10. DSC Webinar Series – with video access

Tags: , , ,

17 short tutorials all data scientists should read

Excelentes textos fundamentais para cientistas dos dados

Excelentes textos fundamentais para cientistas dos dados

Here’s the list:

Related linkThe Data Science Toolkit

Tags: , , , , ,

Little Book of R for Time Series!

Um excelente tutorial sobre modelos de previsão simples no R

Um excelente tutorial sobre modelos de previsão simples no R

Tags: ,