This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Last revision Both sides next revision | ||
bigdata [2019/09/05 17:19] mantis [Cloudera (has swallowed Hortonworks)] |
bigdata [2020/02/13 14:09] mantis [Ingesting into table] |
||
---|---|---|---|
Line 14: | Line 14: | ||
Implementation examples for [[https://dzone.com/articles/kafka-producer-and-consumer-example|Producers and Consumers]], [[https://dzone.com/articles/live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]]. | Implementation examples for [[https://dzone.com/articles/kafka-producer-and-consumer-example|Producers and Consumers]], [[https://dzone.com/articles/live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]]. | ||
+ | |||
+ | ====== Apache Storm ====== | ||
+ | |||
+ | [[https://storm.apache.org/|Storm]] | ||
+ | |||
====== Apache Flume ====== | ====== Apache Flume ====== | ||
Line 178: | Line 183: | ||
* use a small subset of your data | * use a small subset of your data | ||
* use --run-mode local | * use --run-mode local | ||
+ | * enable debugging in conf/log4j.properties <code>org.locationtech.geomesa.convert=debug </code> | ||
* set the error mode to 'raise-errors' in the converter options: https://www.geomesa.org/documentation/user/convert/parsing_and_validation.html#error-mode | * set the error mode to 'raise-errors' in the converter options: https://www.geomesa.org/documentation/user/convert/parsing_and_validation.html#error-mode | ||
Line 296: | Line 302: | ||
hdfs-audit.log files | hdfs-audit.log files | ||
+ | |||
+ | |||
+ | |||
+ | ====== Clouds ====== | ||
+ | |||
+ | https://aws.amazon.com/pricing/ | ||
+ | |||
+ | https://www.cloudera.com/products/pricing.html | ||
+ | |||
+ | https://azure.microsoft.com/en-us/pricing/ | ||
+ | |||
Line 309: | Line 326: | ||
[[https://www.computerworld.com/article/3428025/everything-you-need-to-know-about-the-new-cloudera-data-platform--vision--migration-and-roadmap.html|roadmap for merging HDP into Cloudera]] | [[https://www.computerworld.com/article/3428025/everything-you-need-to-know-about-the-new-cloudera-data-platform--vision--migration-and-roadmap.html|roadmap for merging HDP into Cloudera]] | ||
+ | |||
+ | ==== HDP Spark Developer DEV-343 ==== | ||
+ | |||
[[https://www.cloudera.com/about/training/courses/hdp-spark-developer.html#?classType=virtual|HDP Spark Developer DEV-343]] | [[https://www.cloudera.com/about/training/courses/hdp-spark-developer.html#?classType=virtual|HDP Spark Developer DEV-343]] | ||
+ | |||
+ | DataFrames/DataSets/RDD, shuffling, transformations and performance tuning, streaming | ||
* dataOps, analysts | * dataOps, analysts | ||
Line 316: | Line 338: | ||
* 3200 $ virtual classroom or | * 3200 $ virtual classroom or | ||
* 3500 € Paris | * 3500 € Paris | ||
+ | |||
+ | ==== HDP Data Science SCI-241 ==== | ||
[[https://www.cloudera.com/about/training/courses/applying-data-science-using-apache-hadoop.html|HDP Data Science SCI-241]] | [[https://www.cloudera.com/about/training/courses/applying-data-science-using-apache-hadoop.html|HDP Data Science SCI-241]] | ||
Line 325: | Line 349: | ||
* on request | * on request | ||
+ | ==== Cloudera Certified Associate (CCA) ==== | ||
+ | |||
+ | [[https://www.cloudera.com/about/training/certification.html|Cloudera Certified Associate (CCA)]] consists of 3 parts: | ||
+ | |||
+ | * [[https://university.cloudera.com/content/cca175|CCA Spark and Hadoop Developer exam]] | ||
+ | * [[https://www.cloudera.com/about/training/certification/cca-data-analyst.html|CCA Data Analyst exam]] | ||
+ | * [[https://www.cloudera.com/about/training/certification/cca-admin.html|CCA Administrator exam]] | ||
+ | |||
+ | each exam costs $295 | ||
[[https://www.cloudera.com/about/training/course-listing.html#?course=all|course locations]] | [[https://www.cloudera.com/about/training/course-listing.html#?course=all|course locations]] | ||
Line 332: | Line 365: | ||
https://www.coursera.org/courses?query=apache%20spark | https://www.coursera.org/courses?query=apache%20spark | ||
+ | ===== DataBricks ===== | ||
+ | https://academy.databricks.com/catalog | ||
+ | [[https://academy.databricks.com/instructor-led-training/DB301|DB 301 - Apache Spark™ for Machine Learning and Data Science]] | ||
+ | * 3 days | ||
+ | * virtual classroom | ||
+ | * $2500 | ||
+ | |||
+ | |||
+ | short, self-paced courses based on AWS or Azure: $75 | ||
+ | |||
+ | [[https://academy.databricks.com/category/certifications|certifications]] | ||
===== Dell ===== | ===== Dell ===== | ||
[[https://education.dellemc.com/index_downtime.htm|site down]] | [[https://education.dellemc.com/index_downtime.htm|site down]] | ||
+ | |||
+ | |||
+ | example: [[https://education.dellemc.com/content/dam/dell-emc/documents/en-us/E20_065_Advanced_Analytics_Specialist_Exam.pdf|Specialist -Data Scientist, Advanced Analytics Version 1.0]] | ||
+ | |||
+ | ===== Google ===== | ||
===== Heinlein ===== | ===== Heinlein ===== | ||
Line 355: | Line 404: | ||
[[https://www.coursera.org/specializations/advanced-data-science-ibm|Advanced Data Science with IBM Specialization]] | [[https://www.coursera.org/specializations/advanced-data-science-ibm|Advanced Data Science with IBM Specialization]] | ||
- | |||
- | |||
- | ===== DataBricks ===== | ||
- | |||
- | https://academy.databricks.com/catalog | ||
Line 368: | Line 412: | ||
[[https://mapr.com/training/certification/|The MapR Academy Certification Program is closed to new registration as we work to update the exams.]] | [[https://mapr.com/training/certification/|The MapR Academy Certification Program is closed to new registration as we work to update the exams.]] | ||
+ | |||
+ | ===== Microsoft Azure ===== | ||
+ | |||
+ | |||
===== SimpliLearn ===== | ===== SimpliLearn ===== | ||
Line 375: | Line 423: | ||
- | Aligned to Cloudera CCA175 certification exam | + | Aligned to Cloudera [[https://www.cloudera.com/about/training/certification/cca-spark.html|CCA175]] certification exam |
====== Meetups ====== | ====== Meetups ====== | ||
https://www.meetup.com/Wien-Cloud-Computing-Meetup/ | https://www.meetup.com/Wien-Cloud-Computing-Meetup/ | ||
- | https://viennadatasciencegroup.at/category/events/ | + | https://www.meetup.com/Big-Data-Vienna/ |
+ | |||
+ | https://www.meetup.com/Vienna-Data-Science-Group-Meetup/ | ||
https://www.ocg.at/cloud-computing-big-data | https://www.ocg.at/cloud-computing-big-data | ||