Free Hadoop Tutorial: Master BigData
Posted by Armando Brito Mendes | Filed under lições, materiais ensino, software
BigData is the latest buzzword in the IT Industry. Apache’s Hadoop is a leading Big Data platform used by IT giants Yahoo, Facebook & Google. This course is geared to make a Hadoop Expert.
What should I know?
This is an absolute beginner guide to Hadoop. But knowledge of 1) Java 2) Linux will help
Syllabus
Tutorial | Introduction to BIG DATA: Types, Characteristics & Benefits |
Tutorial | Hadoop Tutorial: Features, Components, Cluster & Topology |
Tutorial | Hadoop Setup Tutorial – Installation & Configuration |
Tutorial | HDFS Tutorial: Read & Write Commands using Java API |
Tutorial | What is MapReduce? How it Works – Hadoop MapReduce Tutorial |
Tutorial | Hadoop & Mapreduce Examples: Create your First Program |
Tutorial | Hadoop MapReduce Tutorial: Counters & Joins with Example |
Tutorial | What is Sqoop? What is FLUME – Hadoop Tutorial |
Tutorial | Sqoop vs Flume vs HDFS in Hadoop |
Tutorial | Create Your First FLUME Program – Beginner’s Tutorial |
Tutorial | Hadoop PIG Tutorial: Introduction, Installation & Example |
Tutorial | Learn OOZIE in 5 Minutes – Hadoop Tutorial |
Tutorial | Big Data Testing: Functional & Performance |
Tutorial | Hadoop & MapReduce Interview Questions & Answers |
Tags: big data
An Introduction to Implementing Neural Networks using TensorFlow
Posted by Armando Brito Mendes | Filed under materiais ensino, software
Uma boa introdução ao tensor flow e deep learning
Introduction
If you have been following Data Science / Machine Learning, you just can’t miss the buzz around Deep Learning and Neural Networks. Organizations are looking for people with Deep Learning skills wherever they can. From running competitions to open sourcing projects and paying big bonuses, people are trying every possible thing to tap into this limited pool of talent. Self driving engineers are being hunted by the big guns in automobile industry, as the industry stands on the brink of biggest disruption it faced in last few decades!
If you are excited by the prospects deep learning has to offer, but have not started your journey yet – I am here to enable it. Starting with this article, I will write a series of articles on deep learning covering the popular Deep Learning libraries and their hands-on implementation.
In this article, I will introduce TensorFlow to you. After reading this article you will be able to understand application of neural networks and use TensorFlow to solve a real life problem. This article will require you to know the basics of neural networks and have familiarity with programming. Although the code in this article is in python, I have focused on the concepts and stayed as language-agnostic as possible.
Let’s get started!
Table of Contents
- When to apply neural nets?
- General way to solve problems with Neural Networks
- Understanding Image data and popular libraries to solve it
- What is TensorFlow?
- A typical “flow” of TensorFlow
- Implementing MLP in TensorFlow
- Limitations of TensorFlow
- TensorFlow vs. other libraries
- Where to go from here?
Tags: big data, data mining, machine learning, text mining
curso de KNIME
Posted by Armando Brito Mendes | Filed under mapas SIG's, materiais para profissionais, software, videos, visualização
Muito bom curso de KNIME, é introdutório mas introduz um grande número de funcionalidades.
KNIME Online Self-Training
Welcome to the KNIME Self-training course. The focus of this document is to get you started with KNIME as quickly as possible and guide you through essential steps of advanced analytics with KNIME. Optional and very useful topics such as reporting, KNIME Server and database handling are also included to give you an idea of what else is possible with KNIME.
- Installing KNIME Analytics Platform and Extensions
- Data Import / Export and Database / Big Data
- ETL
- Visualization
- Advanced Analytics
- Reporting
- KNIME Server
Tags: análise de dados, big data, data mining, Knime, text mining
Os portugueses durante o euro com dados do multibanco
Posted by Armando Brito Mendes | Filed under estatística, visualização
Um bom exemplo da utilização de dados para inferir comportamentos mas a parte das coincidências de valores era dispensável
Como conquistámos o Euro 2016 através do Multibanco (com infografia)
Publicado em: 20/07/2016 – 19:11:26
À hora da final entre Portugal e França, o país parou… e os levantamentos também! Conheça esta e outras curiosidades que marcaram o comportamento dos portugueses com a rede Multibanco à medida que os 23 magníficos conquistavam o Europeu 2016
Guardar
Guardar
Tags: belo, big data, data mining, DW \ BI
Hackers Remotely Kill a Jeep on the Highway
Posted by Armando Brito Mendes | Filed under videos
Um exemplo dos problemas de segunrança ainda existentes no IoT.
Two hackers have developed a tool that can hijack a Jeep over the internet. WIRED senior writer Andy Greenberg takes the SUV for a spin on the highway while the hackers attack it from miles away.
Guardar
Tags: big data, data mining
Deeplearning4j Documentation
Posted by Armando Brito Mendes | Filed under materiais para profissionais, software
O site de um pacote java para deeplearing com montes de info. sobre redes neuronais e afins.
- How To
- Quickstart: Running Examples and DL4J in Your Projects
- Comprehensive Setup Guide
- Build Locally From Master
- Contribute to DL4J (Developer Guide)
- Choose a Neural Net
- Use the Maven Build Tool
- Vectorize Data With Canova
- Build a Data Pipeline
- Run Benchmarks
- Configure DL4J in Ivy, Gradle, SBT etc
- Find a DL4J Class or Method
- Save and Load Models
- Interpret Neural Net Output
- Visualize Data with t-SNE
- Swap CPUs for GPUs
- Customize an Image Pipeline
- Perform Regression With Neural Nets
- Troubleshoot Training & Select Network Hyperparameters
- Visualize, Monitor and Debug Network Learning
- Speed Up Spark With Native Binaries
- Build a Recommendation Engine With DL4J
- Use Recurrent Networks in DL4J
- Build Complex Network Architectures with Computation Graph
- Train Networks using Early Stopping
- Download Snapshots With Maven
- Customize a Loss Function
- Introduction to Neural Networks
- Multilayer Neural Nets
- Tutorials
- Datasets
- Scaleout
- Text
- Resources
- DL4J, Torch7, Theano and Caffe
- Glossary of Terms for Deep Learning and Neural Nets
- Deep Learning’s Accuracy
- DataVec: ETL for ML
- ND4J Backends: How They Work
- Model Zoo
- Unsupervised Learning: Use Cases
- Eigenvectors, PCA, Covariance and Entropy
- Thought Vectors, AI and NLP
- Questions to Ask When Applying DL
- AI, Machine Learning and Deep Learning
- DL and Reinforcement Learning
- Javadoc: DL4J Methods and Classes
- Canova Javadoc: Canova Methods and Classes
- ND4J User Guide
- ND4J Javadoc
- Scala, Spark and Deep Learning
- Further Reading on Deep Learning
- Deep Learning in Other Languages
- Use Cases
- Architecture
- Features
- Roadmap
- About
- Open Data
- Latest Release Notes
Guardar
Tags: análise de dados, big data, data mining, desnvolvimento de software, machine learning
Big Data to Fight Cancer
Posted by Armando Brito Mendes | Filed under estatística
Exemplo de aplicação de técnicas de análise de dados a problemas de medicina
MD Anderson is sitting on 23 petabytes of data, including more than 2 billion diagnostic radiology images, generated by its massive IT infrastructure. But Chris Belmont, vice president and CIO, isn’t intimidated by the amount of data—he’s just scared of staring at it too long.
“Our biggest fear when we decided to move into Big Data was that, like many healthcare organizations, we’d have a two-year data ‘ingestion’ process where we’d keep thinking about that massive set of data, and connect all our systems big and small together, go get even more data from external sources, and then eventually offer our users an add-on tool and tell them to go at it,” Belmont says. “By the time we’d be done ingesting all that data, the time to change the game in terms of costs or population health would have already passed.”
Tags: análise de dados, big data, data mining, decisão médica
UK Data Service
Posted by Armando Brito Mendes | Filed under data sets, estatística
Bom site com muitos data sets de grande dimensão. Assuntos relacionados com censos e inquéritos.
Explore the UK’s largest collection of social, economic and population data resources.
Data types
Tags: big data, data mining, inquéritos
Best Data Science Learning podcasts
Posted by Armando Brito Mendes | Filed under lições, materiais ensino, materiais para profissionais, videos
Muito bons podcasts tem temas introdutórios
We present the top 12 Data Science & Machine Learning related Podcasts by popularity on iTunes. Check out latest episodes to stay up-to-date & become a part of the data conversations!
By Bhavya Geethika Peddibhotla.
Learn Data science the new way by listening to these compelling story tellers, interviewers, educators and experts in the field. Data suggests that podcasting about Data Science is only growing!
Tags: análise de dados, big data, data mining, desnvolvimento de software, Estat Descritiva, machine learning
What is Data Virtualization?
Posted by Armando Brito Mendes | Filed under videos
Muito clara introdução ao tema da virtualização de dados.
What is Data Virtualization?
5 882
Tags: big data, data mining, DW \ BI