Deeplearning4j Documentation

clique na imagem para seguir o link

clique na imagem para seguir o link

O site de um pacote java para deeplearing com montes de info. sobre redes neuronais e afins.

Guardar

Tags: , , , ,

The Many Faces of ROC Analysis

clicar na imagem para seguir o link

clicar na imagem para seguir o link

Bom tutorial sobre curvas ROC

Receiver Operating Characteristics (ROC) Analysis originated from signal detection theory, as a model of how well a receiver is able to detect a signal in the presence of noise. Its key feature is the distinction between hit rate (or true positive rate) and false alarm rate (or false positive rate) as two separate performance measures. ROC analysis has also widely been used in medical data analysis to study the effect of varying the threshold on the numerical outcome of a diagnostic test. It has been introduced to machine learning relatively recently, in response to classification tasks with varying class distributions or misclassification costs (hereafter referred to as skew). ROC analysis is set to cause a paradigm shift in machine learning. Separating performance on classes is almost always a good idea from an analytical perspective. For instance, it can help us to

  • understand the behaviour and skew-sensitivity of many machine learning metrics, including rule learning heuristics and decision tree splitting criteria, by plotting their isometrics in ROC space;
  • develop new metrics specifically designed to improve the Area Under the ROC Curve (AUC) of a model;
  • understand fundamental algorithms such as the separate-and-conquer or sequential covering rule learning algorithm, by tracing its trajectory through a sequence of ROC spaces.

The goal of this tutorial is to develop the ROC perspective in a systematic way, demonstrating the many faces of ROC analysis in machine learning.

Tags: , ,

C Tutorial

clicar na imagem para seguir o link

clicar na imagem para seguir o link

Bom tutorial de C on-line.

Learn C with our popular C tutorial, which will take you from the very basics of C all the way through sophisticated topics like binary trees and data structures. By the way, if you’re on the fence about learning C or C++, I recommend going through the C++ tutorial instead as it is a more modern language.

Introduction and Basic C Features

Pointers, Arrays and Strings

File IO and command line arguments

Linked lists, binary trees, recursion

Finished with all these tutorials? Do some practice problems or view more tutorials.

why predicting a mass shooting is impossible

clicar na imagem para seguir o link

clicar na imagem para seguir o link

Excelente exemplo na previsão de acontecimentos raros

This cartoon explains why predicting a mass shooting is impossible

It’s simple math.

Updated by and on June 15, 2016, 10:30 a.m. ET

In the wake of mass shootings, many people wonder how they could have been prevented. Were there warning signs that should have been heeded? Was the person mentally ill? Did he or she hold extremist views?

The sad truth is that the only personal factors that reliably correlate with mass shooters are being young and male. There are a lot of young, angsty men in this country. That makes prediction hard.

Guardar

Tags: ,

Deconstructing Recommender Systems

clique na imagem para seguir o link

clique na imagem para seguir o link

uma boa descrição dos sistemas de recomendação usados em sites web

Deconstructing Recommender Systems

How Amazon and Netflix predict your preferences and prod you to purchase

By Joseph A. Konstan, John Riedl

Posted
sub map

Photo-illustration: Christina Beard

One morning in April, we each directed our browsers to Amazon.com’s website. Not only did the site greet us by name, the home page opened with a host of suggested purchases. It directed Joe to Barry Greenstein’s Ace on the River: An Advanced Poker Guide, Jonah Lehrer’s Imagine: How Creativity Works, and Michael Lewis’s Boomerang: Travels in the New Third World. For John it selected Dave Barry’s Only Travel Guide You’ll Ever Need, the spy novel Mission to Paris, by Alan Furst, and the banking exposé The Big Short: Inside the Doomsday Machine, also by Michael Lewis.

Guardar

Tags:

Portal de dados abertos da União Europeia

clique na imagem para seguir o link

clique na imagem para seguir o link

Boa fonte de dados para trabalhos

O que é o Portal de Dados?

Gostaria de aceder facilmente aos dados da UE? Pretende reutilizar os dados para, por exemplo, realizar um trabalho de investigação, redigir um artigo ou desenvolver uma aplicação ?

Então está no sítio certo. O Portal de Dados Abertos da UE serve de ponto de acesso único a um número crescente de dados produzidos pelas instituições e outros organismos da União Europeia.

Pode utilizar e reutilizar os dados, criar ligações para os mesmos ou redistribuí los para fins comerciais e não comerciais.

Tags: ,

Big Data to Fight Cancer

clique na imagem para seguir o link

clique na imagem para seguir o link

Exemplo de aplicação de técnicas de análise de dados a problemas de medicina

MD Anderson is sitting on 23 petabytes of data, including more than 2 billion diagnostic radiology images, generated by its massive IT infrastructure. But Chris Belmont, vice president and CIO, isn’t intimidated by the amount of data—he’s just scared of staring at it too long.

“Our biggest fear when we decided to move into Big Data was that, like many healthcare organizations, we’d have a two-year data ‘ingestion’ process where we’d keep thinking about that massive set of data, and connect all our systems big and small together, go get even more data from external sources, and then eventually offer our users an add-on tool and tell them to go at it,” Belmont says. “By the time we’d be done ingesting all that data, the time to change the game in terms of costs or population health would have already passed.”

Tags: , , ,

A Growth Hacker’s Journey

clicke na imagem para seguir o link

clicke na imagem para seguir o link

Uma história de vida muito interessante de um cientista dos dados \ hacker
Dr. Kirk Borne
Principal Data Scientist, Booz Allen Hamilton

Dr. Kirk Borne is a Principal Data Scientist at Booz Allen Hamilton. Previously he was a Professor of Astrophysics and Computational Science in the George Mason University School of Physics, Astronomy, and Computational Sciences. He was at Mason from 2003 to 2015, where he taught and advised students in the graduate and undergraduate Computational Science, Informatics, and Data Science programs. Before Mason, he spent nearly 20 years in positions supporting NASA projects, including an assignment as NASA’s Data Archive Project Scientist for the Hubble Space Telescope, and as Project Manager in NASA’s Space Science Data Operations Office. He has extensive experience in big data and data science, including expertise in scientific data mining and data systems. He has published over 200 articles (research papers, conference papers, and book chapters), and given over 200 invited talks at conferences and universities worldwide. In these roles, he focuses on achieving big discoveries from big data through data science, and he promotes the use of information and data-centric experiences with big data in the STEM education pipeline at all levels. He believes in data literacy for all! Learn more about him at http://kirkborne.net/. You can follow him on and on Twitter at @KirkDBorne, where he has been identified as one of the social network’s top big data influencers.

Tags: ,

UK Data Service

clique na imagem para seguir a ligação

clique na imagem para seguir a ligação

Bom site com muitos data sets de grande dimensão. Assuntos relacionados com censos e inquéritos.

Explore the UK’s largest collection of social, economic and population data resources.

See more

Video tutorials

See our growing range of training videos

Data types

See more

Tags: , ,

Best Data Science Learning podcasts

KDnuggets

Muito bons podcasts tem temas introdutórios

We present the top 12 Data Science & Machine Learning related Podcasts by popularity on iTunes. Check out latest episodes to stay up-to-date & become a part of the data conversations!

By Bhavya Geethika Peddibhotla.

Learn Data science the new way by listening to these compelling story tellers, interviewers, educators and experts in the field. Data suggests that podcasting about Data Science is only growing!

Tags: , , , , ,