Portal de Informação Empresarial do IRN
Posted by Armando Brito Mendes | Filed under data sets
- Autonomia financeira
- Endividamento
- Margem líquida sobre as vendas (MLSV)
- Retorno do capital próprio (ROE)
- Volume de negócios
- Total de trabalhadores
- Trabalhadores a tempo parcial
- Trabalhadores afectos a Investigação e Desenvolvimento
- Trabalhadores não remunerados
- Total de declarações IES
- Total de declarações IES com anexo A
- Total de declarações IES com anexo A1
- Total de declarações IES com anexo B
- Total de declarações IES com anexo B1
- Total de declarações IES com anexo C
- Total de declarações IES com anexo C1
- Associação na hora
- Aumentos de capital
- Certificados de admissibilidade para alteração de empresa
- Certificados de admissibilidade para constituição de empresa
- Empresa na hora
- Empresa online
- Empresas com actividade internacional
- Empresas com comércio electrónico
- Empresas constituídas
- Empresas extintas
- Encargos Sociais
- Exportações
- Importações
- Médias empresas
- Micro empresas
- Pequenas empresas
- Remunerações
- Sucursal na hora
- Total de empresas
Tags: análise de dados
Rede Hidrometeorológica dos Açores
Posted by Armando Brito Mendes | Filed under data sets, estatística, visualização
Repository of data sets
Posted by Armando Brito Mendes | Filed under data sets, estatística
Various bioinformatics datasets converted to ARFF by Jesús S. Aguilar-Ruiz and BioInformatics Group Seville (BIGS)
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand…
Miscellaneous datasets from different sources. See also collections of data: UCI, StatLib, KDD Cup, PROMISE and others, located in separate…
Example implementations of algorithms that can be tested with TunedTester.
Resources of the data mining contest associated with IEEE International Conference on Data Mining (ICDM)
Tags: análise de dados, data mining
Stanford Large Network Dataset Collection
Posted by Armando Brito Mendes | Filed under ARS - SNA, data sets
- Social networks: online social networks, edges represent interactions between people
- Communication networks: email communication networks with edges representing communication
- Citation networks: nodes represent papers, edges represent citations
- Collaboration networks: nodes represent scientists, edges represent collaborations (co-authoring a paper)
- Web graphs: nodes represent webpages and edges are hyperlinks
- Amazon networks : nodes represent products and edges link commonly co-purchased products
- Internet networks : nodes represent computers and edges communication
- Road networks : nodes represent intersections and edges roads connecting the intersections
- Autonomous systems : graphs of the internet
- Signed networks : networks with positive and negative edges (friend/foe, trust/distrust)
- Wikipedia networks and metadata : Talk, editing and voting data from Wikipedia
- Twitter and Memetracker : Memetracker phrases, links and 467 million Tweets
Tags: análise de dados, ARS\SNA intro, grafos
Data Preservation Alliance for the Social Sciences (Data-PASS)
Posted by Armando Brito Mendes | Filed under data sets
The Data Preservation Alliance for the Social Sciences (Data-PASS) is a voluntary partnership of organizations created to archive, catalog and preserve data used for social science research. Examples of social science data include: opinion polls; voting records; surveys on family growth and income; social network data; government statistics and indices; and GIS data measuring human activity.
Tags: inquéritos
KDD Cup Center
Posted by Armando Brito Mendes | Filed under data sets
- KDD-Cup 2010 – Student performance evaluation
- KDD-Cup 2009 – Customer relationship prediction
- KDD-Cup 2008 – Breast cancer
- KDD-Cup 2007 – Consumer recommendations
- KDD-Cup 2006 – Pulmonary embolisms detection from image data
- KDD-Cup 2005 – Internet user search query categorization
- KDD-Cup 2004 – Particle physics; plus protein homology prediction
- KDD-Cup 2003 – Network mining and usage log analysis
- KDD-Cup 2002 – BioMed document; plus gene role classification
- KDD-Cup 2001 – Molecular bioactivity; plus protein locale prediction
- KDD-Cup 2000 – Online retailer website clickstream analysis
- KDD-Cup 1999 – Computer network intrusion detection
- KDD-Cup 1998 – Direct marketing for profit optimization
- KDD-Cup 1997 – Direct marketing for lift curve optimization
Tags: captura de conhecimento, data mining
Time Series Data Library
Posted by Armando Brito Mendes | Filed under data sets, estatística
Subjects:Agriculture,Chemistry,Crime,Demography,Ecology,Finance,Health,Hydrology,Industry,Labour , market,Links,Macro-Economic,Meteorology,Micro-Economic,Miscellaneous,Physics,Production,Sales,Simulated series,Sport,Transport and tourism,Tree-rings,Utilities
Tags: data mining, previsão
Linear Regression Datasets
Posted by Armando Brito Mendes | Filed under data sets, estatística
Site com dados para teste de algoritmos de regressão.
Tags: data mining
Google labs Books Ngram data sets
Posted by Armando Brito Mendes | Filed under data sets
Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set).
Datasets for Data Mining – Univ. Edinburgh
Posted by Armando Brito Mendes | Filed under data sets
Datasets for Data Mining
This page contains a list of datasets that were selected for the projects for Data Mining and Exploration. Students can choose one of these datasets to work on, or can propose data of their own choice. At the bottom of this page, you will find some examples of datasets which we judged as inappropriate for the projects.
Tags: data mining