This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
bigdata [2019/09/12 11:26] mantis [Cloudera Certified Associate (CCA)] |
bigdata [2020/07/13 13:39] (current) mantis |
||
---|---|---|---|
Line 14: | Line 14: | ||
Implementation examples for [[https://dzone.com/articles/kafka-producer-and-consumer-example|Producers and Consumers]], [[https://dzone.com/articles/live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]]. | Implementation examples for [[https://dzone.com/articles/kafka-producer-and-consumer-example|Producers and Consumers]], [[https://dzone.com/articles/live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]]. | ||
+ | |||
+ | ====== Apache Storm ====== | ||
+ | |||
+ | [[https://storm.apache.org/|Storm]] | ||
+ | |||
====== Apache Flume ====== | ====== Apache Flume ====== | ||
Line 162: | Line 167: | ||
[[https://developer.ibm.com/hadoop/2017/01/06/start-flume-agents-using-ambari-web-interface/|Flume in Ambari]] | [[https://developer.ibm.com/hadoop/2017/01/06/start-flume-agents-using-ambari-web-interface/|Flume in Ambari]] | ||
- | ====== Apache Storm ====== | + | |
Line 178: | Line 183: | ||
* use a small subset of your data | * use a small subset of your data | ||
* use --run-mode local | * use --run-mode local | ||
+ | * enable debugging in conf/log4j.properties <code>org.locationtech.geomesa.convert=debug </code> | ||
* set the error mode to 'raise-errors' in the converter options: https://www.geomesa.org/documentation/user/convert/parsing_and_validation.html#error-mode | * set the error mode to 'raise-errors' in the converter options: https://www.geomesa.org/documentation/user/convert/parsing_and_validation.html#error-mode | ||
Line 296: | Line 302: | ||
hdfs-audit.log files | hdfs-audit.log files | ||
+ | |||
+ | |||
+ | |||
+ | ====== Clouds ====== | ||
+ | |||
+ | https://aws.amazon.com/pricing/ | ||
+ | |||
+ | https://www.cloudera.com/products/pricing.html | ||
+ | |||
+ | https://azure.microsoft.com/en-us/pricing/ | ||
+ | |||
Line 314: | Line 331: | ||
[[https://www.cloudera.com/about/training/courses/hdp-spark-developer.html#?classType=virtual|HDP Spark Developer DEV-343]] | [[https://www.cloudera.com/about/training/courses/hdp-spark-developer.html#?classType=virtual|HDP Spark Developer DEV-343]] | ||
+ | |||
+ | DataFrames/DataSets/RDD, shuffling, transformations and performance tuning, streaming | ||
* dataOps, analysts | * dataOps, analysts | ||
Line 334: | Line 353: | ||
[[https://www.cloudera.com/about/training/certification.html|Cloudera Certified Associate (CCA)]] consists of 3 parts: | [[https://www.cloudera.com/about/training/certification.html|Cloudera Certified Associate (CCA)]] consists of 3 parts: | ||
- | [[https://university.cloudera.com/content/cca175|CCA Spark and Hadoop Developer exam]] | + | * [[https://university.cloudera.com/content/cca175|CCA Spark and Hadoop Developer exam]] |
- | + | * [[https://www.cloudera.com/about/training/certification/cca-data-analyst.html|CCA Data Analyst exam]] | |
- | [[https://www.cloudera.com/about/training/certification/cca-data-analyst.html|CCA Data Analyst exam]] | + | * [[https://www.cloudera.com/about/training/certification/cca-admin.html|CCA Administrator exam]] |
- | + | ||
- | [[https://www.cloudera.com/about/training/certification/cca-admin.html|CCA Administrator exam]] | + | |
each exam costs $295 | each exam costs $295 | ||
Line 352: | Line 369: | ||
https://academy.databricks.com/catalog | https://academy.databricks.com/catalog | ||
+ | [[https://academy.databricks.com/instructor-led-training/DB301|DB 301 - Apache Spark™ for Machine Learning and Data Science]] | ||
+ | * 3 days | ||
+ | * virtual classroom | ||
+ | * $2500 | ||
+ | short, self-paced courses based on AWS or Azure: $75 | ||
+ | [[https://academy.databricks.com/category/certifications|certifications]] | ||
===== Dell ===== | ===== Dell ===== | ||
Line 361: | Line 384: | ||
example: [[https://education.dellemc.com/content/dam/dell-emc/documents/en-us/E20_065_Advanced_Analytics_Specialist_Exam.pdf|Specialist -Data Scientist, Advanced Analytics Version 1.0]] | example: [[https://education.dellemc.com/content/dam/dell-emc/documents/en-us/E20_065_Advanced_Analytics_Specialist_Exam.pdf|Specialist -Data Scientist, Advanced Analytics Version 1.0]] | ||
+ | |||
+ | ===== Google ===== | ||
+ | |||
===== Heinlein ===== | ===== Heinlein ===== | ||
[[https://www.heinlein-support.de/schulung/big-data-mit-hadoop|Big Data mit Hadoop]] | [[https://www.heinlein-support.de/schulung/big-data-mit-hadoop|Big Data mit Hadoop]] | ||
Line 397: | Line 423: | ||
- | Aligned to Cloudera CCA175 certification exam | + | Aligned to Cloudera [[https://www.cloudera.com/about/training/certification/cca-spark.html|CCA175]] certification exam |
====== Meetups ====== | ====== Meetups ====== | ||
https://www.meetup.com/Wien-Cloud-Computing-Meetup/ | https://www.meetup.com/Wien-Cloud-Computing-Meetup/ | ||
- | https://viennadatasciencegroup.at/category/events/ | + | https://www.meetup.com/Big-Data-Vienna/ |
+ | |||
+ | https://www.meetup.com/Vienna-Data-Science-Group-Meetup/ | ||
https://www.ocg.at/cloud-computing-big-data | https://www.ocg.at/cloud-computing-big-data | ||