S-PLUS & R Class Links
Posted by Armando Brito Mendes | Filed under estatística, materiais ensino, materiais para profissionais, software
montes de materiais para R e S-PLUS.
S-PLUS & R Class Links
Instructor: Richard Herrington
Why Do We Care To Use the “S” Language? Does anyone care besides us? The Association for Computing Machinery (ACM) cares
S-Plus
- Introduction to S language (S-Plus, R)
- S-Plus Windows Notes (MASS – Brian Ripley)
- S-Plus 6 Users Guide
- S-Plus 6 Users Guide to “Introduction to the Practice of Statistics”
- Datasets for “Introduction to the Practice of Statistics”
- S-Plus Tutorial
- S-Plus Website
- S-Plus Official Documentation
S-PLUS Student Edition Download (Free)
- Student Edition 6.2 – This version of S-Plus has a 20,000 cell or 1,000 row limitation; is only for educational use; is good for only one year; and is a rather large download (100+ meg).
S-PLUS Free Experimental Libraries and User Contributed Libraries
- Research Libraries – Includes: S+CorrelatedData (mixed effects generalized linear models), S+Best (B-Spline methods), S+Resample (bootstrap library), S+Bayes (bayesian analysis), S+FDA (functional data analysis).
- User Contributed Libraries
Tinn-R Script Editor
- SourceForge Download Site – Download Tinn-R Setup Files; Tinn-R can serve as a script editor and “pager” for R Console. See Tinn-R Convenient Script Editor for R on Win32 Platform
R
Download Site for the Current Windows Install Binary and R Packages
- Main CRAN Website
- CRAN Windows Binary – Installer for Win32 (also available for MacOS X and Linux
- CRAN R Package Descriptions – Pdf manuals available for packages
- R Packages Organized by Topic – Organized by discipline and methodology
Web Interfaces to R Web Servers and Example R Scripts
- R Web Interfaces – Web/browser based interfaces to R script processing on a server
- Example R Scripts – Some of these scripts run on a server and results are communicated thru a web browser
- RSS Rweb Server – Link to http:/rss.acs.unt.edu R server
R, R(D)COM and Excel
- Introduction to R – ppt
- Using R with Excel – A (D)COM Server for the Windows Platform – Benchmarks article (needs updating badly – Rich)
- R(D)COM Server Homepage – Thomas Baier
- R(D)COM Listserve Archive
- Download for single install file – R-2.4.1, R(D)COM and RExcel only
- Download for single install file – R-2.4.1 – use this file for a comprehensive install of selectable components: R, Rcmdr, R(D)COM, RExcel, rcom, gobbi, Rgobbi, and Tinn-R. R Version 2.5.0 combined install found here.
- Installing The R(D)COM server – Help Pages
- Using The R(D)COM server – Help Pages
- Using R Within Excel – Help Pages
- Duncan Temple Lang’s R (D)COM Homepage
- Duncan Temple Lang’s R (D)COM Notes
- Duncan Temple Lang’s R (D)COM Client examples– directory listing
- Duncan Temple Lang’s R (D)COM Server – examples listing
- General Computing Considerations
Tags: Excel, R-software, software estatístico
Scholarpedia
Posted by Armando Brito Mendes | Filed under estatística, matemática, materiais para profissionais, refs bibliográficas
Uma alternativa à weekepedia com suposta melhor qualidade.
Welcome to Scholarpedia
the peer-reviewed open-access encyclopedia,
where knowledge is curated by communities of experts
Disseminate: share your expertise with a global audience
Pioneer: write the first persistent online review in your area of specialization
Steward: supervise the development of articles in your field
Collaborate: work with expert scientists and scholars from around the world
Learn: gain experience with scholarly writing and editing
Publish: transform your writing into a peer-reviewed article
erts
Tags: data mining, motores de busca
Spreadsheet Addiction
Posted by Armando Brito Mendes | Filed under estatística, materiais para profissionais
Um bom e muito completo relato dos defeitos do MS Excel para análise de dados.
Some people will think that the “addiction” in the title is over the top, or at least used metaphorically. It is used literally, and is not an exaggeration.
Addiction is the persistent use of a substance where that use is detrimental to the user. It is not the substance that is the problem — more limited use may be beneficial. It is the extent and circumstances of the use that determine if the behavior is addictive or not.
Spreadsheets are a wonderful invention. They are an excellent tool for what they are good at. The problem is that they are often stretched far beyond their home territory. Dangerous abuse of spreadsheets is only too common.
I know there are many spreadsheets in financial companies that take all night to compute. These are complicated and commonly fail. When such spreadsheets are replaced by code more suited to the task, it is not unusual for the computation time to be cut to a few minutes and the process much easier to understand.
A 2012 example of spreadsheet addiction.
The technology acceptance model holds that there are two main factors that determine the uptake of a technology: the perceived usefulness and the perceived ease-of-use. Perception need not correspond to reality.
The perception of the ease-of-use of spreadsheets is to some extent an illusion. It is dead easy to get an answer from a spreadsheet, however, it is not necessarily easy to get the right answer. Thus the distorted view.
The difficulty of using alternatives to spreadsheets is overestimated by many people. Safety features can give the appearance of difficulty when in fact these are an aid.
The hard way looks easy, the easy way looks hard.
The remainder of this page is divided into the sections:
Spreadsheet Computation
The Treatment Center (Alternatives)
If You Must Persist
Specific Problems with Excel
Additional Links
Tags: análise de dados, Excel, programação em folha de cálculo
income rise hints at recovery
Posted by Armando Brito Mendes | Filed under estatística, materiais para profissionais, visualização
Mais uma excelente representação gráfica interativa de um jornal on-line.
By Ted Mellnik and Lazaro Gamio, Published: Sept. 18, 2014
Although incomes are still lower than five years ago, most large metropolitan areas showed at least a tiny gain last year. The patterns suggest that while many regional economies may have turned a corner on the recession, incomes are making a slow advance toward 2009 levels. These charts show data for median household incomes released on Thursday by the Census Bureau in its American Community Survey. Related story.
Tags: belo
PlotDevice: Draw with Python
Posted by Armando Brito Mendes | Filed under estatística, materiais para profissionais, software, visualização
Uma biblioteca de funções em Pyton para construir visualizações de dados.
You’ve been able to visualize data with Python for a while, but Mac application PlotDevice from Christian Swinehart couples code and graphics more tightly. Write code on the right. Watch graphics change on the right.
The application gives you everything you need to start writing programs that draw to a virtual canvas. It features a text editor with syntax highlighting and tab completion plus a zoomable graphics viewer and a variety of export options.
PlotDevice’s simple but comprehensive set of graphics commands will be familiar to users of similar graphics tools like NodeBox or Processing. And if you’re new to programming, you’ll find there’s nothing better than being able to see the results of your code as you learn to think like a computer.
Looks promising. Although when I downloaded it and tried to run it, nothing happened. I’m guessing there’s still compatibility issues to iron out at version 0.9.4. Hopefully that clears up soon. [via Waxy]
Tags: big data, data mining, desnvolvimento de software, Estat Descritiva
What is probabilistic programming?
Posted by Armando Brito Mendes | Filed under materiais para profissionais, software
Uma nova forma de trabalhar com modelos Bayesianos!
«
In this post, I introduce the emerging area of probabilistic programming, showing how probabilistic programs will hopefully make it easier to perform Bayesian-style machine learning, among other applications. Probabilistic programming is an exciting, and growing, area of research, with fantastic people in both AI/ML and PL working together and making big strides. PL methods — including formal semantics, optimization techniques, and forms of static analysis — have proven very useful in advancing this area forward.
Tags: bioinformatica, data mining, decisão médica, definição, desnvolvimento de software, qualidade
Why Content Marketing Fails
Posted by Armando Brito Mendes | Filed under materiais para profissionais, planeamento, software
Why Content Marketing Fails Presentation Transcript
- Rand Fishkin, Wizard of Moz | @randfish | rand@moz.com Why Content Marketing Fails
- Download this Slide Deck bit.ly/mozcontentfail
- Failure Sucks
- Most Content Efforts Will Fail Spike of hope Flatline of nope
- You’ll Invest with the Best of Intentions
- You’ll Launch with Excitement
- And it Will Suck
- And You Won’t Know Why
- Why Your Content Marketing Will Fail 5 Reasons
- You Believed the Biggest Myth Content Marketing Ever Told the World #1
- We Imagine Content Marketing Works Like This… Let’s go see what’s on the Internet today, shall we?
- I wonder what this post Kieran tweeted is all about.
- That was an interesting. I should sign up for this free ebook download while I’m here!
- And I’ll just fill out their contact form with my information….
- Oh! Better not forget to follow all their social profiles while I’m at it.
- You might laugh, but a lot of companies invest in content marketing on the assumption that this is how it works! Me make content. Humans click. Them buy. Me get money.
- In Reality, It Works Like This: Let’s see what’s on Hacker News… Meh.
- How about Facebook? Oh, right. Baby pictures.
- Let’s try Twitter. Meh.
- Maybe Google+? Actually… This looks kinda interesting.
- After seeing 400+ links I could have clicked, I finally chose this one. It’s a good article, and this example, naturally, resonated
- I wonder why I like this blog so much? About a week later, I caught a link to Beardbrand’s blog.
- No magic trick to grow a thicker beard?! Dammit! I’d followed them on Google+, so I watched this video they shared, too.
- When my lovely wife told me that I might need to look into some mustache wax, I thought of Beardbrand. But I couldn’t remember their name! Lovely Wife Imminent need for mustache wax Who were those guys that made the cool beard stuff?
- So, naturally, I searched Google: Nope Nope Nope Yes!!
- A Perfect Match!
- Say goodbye bushy handlebars!
- This is How Content Marketing Really Works: Caveman Rand explain.
- This is How Content Marketing Really Works: Me Make Content. Humans click. If them like, them remember. Maybe them see more content I make. Visit again. Me build trust, relationship with humans. When them need me product, them come back.
- Content marketing not about convert 1st visit. Or 2nd. Or 3rd. Only foolish humans think it work like this.
- Content marketing about earning familiarity, trust, and relationship.
- Maybe sale come. Maybe not. Smart cave marketer no care. Smart cave marketer know every visit chance to build relationship. Maybe earn fan. That good enough.
- The Obligation Rests on Marketers to Set the Right Expectations with Our Teams & Clients
- You Made Content Without a Community #2
- Does Content Spread Simply Because It’s Really, Really Good? If this content’s really good, it’ll just spread “virally,” right guys?
- OK, Probably Not Ha! That’s a good one, Rand!
- In My Experience, Content Spreads Because It Inspires a Community
- It Reinforces a Belief
- Refutes an Opposing Argument
- Starts (or Renews) a Passionate Discussion
- Is in Someone’s Financial/Promotional Interests
- Leverages Group Inclusion Dynamics
- Makes the Sharer Look Smart/Important/Worldly/Etc
- “Good Enough” Content Often Performs Well when a Community Is Behind It
- Only the Best 0.1% of Content Can Go “Viral” without a Pre-existing Community
- Don’t Bet Your Marketing on Being the 1 in 1,000
- Before you create content, ask the question: “Who will support & amplify this content and why?”
- You Invested in Content Creation, But Not in its Amplification #3
- Content Must Reach People in Order to Reach Its Potential
- Channels for Reaching the Right People Depend on Your Audience
- Most Amplification Methods Fall Into These Three Buckets: Broadcast 1:1 Paid Promotion
- 1) Broadcast (often via Social Media, Email, or through Events)
- 2) 1:1 Outreach (via Social, Email, or In- Person)
- 3) Paid Amplification (many varieties)
- A basic process for getting content amplification right:
- STEP 1: Find Successful Content in Your Niche http://buzzsumo.com
- STEP 2(a): Find Where It’s Being Shared http://buzzsumo.com
- STEP 2(b): Go Beyond Social Networks Google Search and https://freshwebexplorer.moz.com/
- STEP 2(c): Find Who’s Doing the Sharing http://buzzsumo.com
- STEP 2(c): Find Who’s Doing the Sharing http://www.blindfiveyearold.com/ripples-bookmarklet
- STEP 3: Copy What’s Working For Them Via http://followerwonk.com
- STEP 4: Build Relationships with Those Who Can Help http://www.slideshare.net/RickTRamos1/why-do-peopleshareonline
- Don’t Treat Amplification as “Fire & Forget.” Experiment, Learn, & Apply. Via http://bit.ly
- Think of Each Channel Like a Muscle to Be Flexed & Strengthened Regularly
- Don’t Forget to Leverage What is Still the Most Powerful Sharing Channel (according to NYT): http://www.slideshare.net/RickTRamos1/why-do-peopleshareonline
- You Ignored Content’s Most Powerful Channel: SEO #4
- Google Search Has Grown Massively ~6 Billion Searches/Day! http://www.statisticbrain.com/google-searches/
- While Search Traffic is Distributed, Social is Highly Concentrated http://www.simplereach.com/blog/facebook-continues-to-be-the-biggest-driver-of-social-traffic/
- You Might Have Seen Re/Code & Buzzfeed Claiming Google Search Traffic Was Dead http://recode.net/2014/02/02/the-year-facebook-blew-past-google/
- Here’s Define Media Group refuting that with data from 87 publishers & 48 billion pageviews http://www.definemg.com/hey-buzzfeed-search-traffic-is-doing-just-fine/
- SEO is Also Critical Because of Intent “Do things” mode “Browse” mode
- When Done Right, Content Marketing is the Rising Tide that Lifts the SEO Ships Without content marketing, it’s incredibly hard to earn the types of links that will confer domain authority & rankings in search engines.
- When Done Right, Content Marketing is the Rising Tide that Lifts the SEO Ships But as content on a site earns links, it helps every other page on that domain rank better in the search engines, lifting the tide!
- At the Very Least, Do Your Keyword Research Be aware that AdWords may not show you all the relevant keywords: http://moz.com/blog/be-careful-using-adwords-for-keyword-research
- Better Yet – Gain a Deep Understanding of How SEO & Content Work Together 4 Posts I’d Recommend: • The Convergence of SEO & Content Marketing • Link Building vs. Content Marketing • How to Build a Content Marketing Strategy • Build & Operate a Content Marketing Machine
- You Gave Up Way Too Soon #5
- The 0.1% Always Look Like Overnight Successes
- In Reality, It Looks More Like This: Geraldine’s Travel Blog: http://everywhereist.com
- Geraldine started her blog in 2009 In Reality, It Looks More Like This:
- For 2 years, she never broke 100 visits/day. In Reality, It Looks More Like This:
- Then she had a few posts get some attention In Reality, It Looks More Like This:
- But traffic fell back down soon after. In Reality, It Looks More Like This:
- This is where most people give up. In Reality, It Looks More Like This:
- These days, she gets 100,000+ visits each month In Reality, It Looks More Like This:
- The Price of Success is Failure after Failure after Failure * Hopefully, each of those failures provides an opportunity to learn. *
- Why Content Marketing Fails Rand Fishkin, Wizard of Moz | @randfish | rand@moz.com bit.ly/mozcontentfail
Tags: b-learning, belo, desnvolvimento de software
A World of Terror
Posted by Armando Brito Mendes | Filed under estatística, materiais para profissionais, visualização
Exploring the reach, frequency and impact of terrorism around the world
The data used in this tool comes from the Global Terrorism Database, the most comprehensive collection of terrorism data available.
Tags: belo, Estat Descritiva
Big data: The next frontier for innovation
Posted by Armando Brito Mendes | Filed under materiais para profissionais
The amount of data in our world has been exploding, and analyzing large data sets—so-called big data—will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus, according to research by MGI and McKinsey’s Business Technology Office. Leaders in every sector will have to grapple with the implications of big data, not just a few data-oriented managers. The increasing volume and detail of information captured by enterprises, the rise of multimedia, social media, and the Internet of Things will fuel exponential growth in data for the foreseeable future.
Tags: big data, data mining, modelos empresariais
Vector maps on the web with Mapbox GL
Posted by Armando Brito Mendes | Filed under mapas SIG's, materiais para profissionais, software, visualização
Online mapping just got an upgrade:
Announcing Mapbox GL JS — a fast and powerful new system for web maps. Mapbox GL JS is a client-side renderer, so it uses JavaScript and WebGL to dynamically draw data with the speed and smoothness of a video game. Instead of fixing styles and zoom levels at the server level, Mapbox GL puts power in JavaScript, allowing for dynamic styling and freeform interactivity.
For the non-developers: Online maps are typically stored pre-made on a server, in the form of a bunch of image files that are stitched together when you zoom in and out of a map. So developers have to periodically update the image files if they want their base maps to change. It’s a hassle, which is why base maps often look similar. With Mapbox GL, making changes is easier because the development pipeline is shorter.
More details on the JavaScript library here.
Tags: desnvolvimento de software, mapas