User Tools

Site Tools


bigdata

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
bigdata [2019/09/05 17:21]
mantis [Cloudera (has swallowed Hortonworks)]
bigdata [2020/07/13 13:39] (current)
mantis
Line 14: Line 14:
  
 Implementation examples for [[https://​dzone.com/​articles/​kafka-producer-and-consumer-example|Producers and Consumers]],​ [[https://​dzone.com/​articles/​live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]]. Implementation examples for [[https://​dzone.com/​articles/​kafka-producer-and-consumer-example|Producers and Consumers]],​ [[https://​dzone.com/​articles/​live-dashboard-using-apache-kafka-and-spring-webso|Kafka with Spring WebSockets]].
 +
 +====== Apache Storm ======
 +
 +[[https://​storm.apache.org/​|Storm]]
 +
  
 ====== Apache Flume ====== ====== Apache Flume ======
Line 162: Line 167:
  
 [[https://​developer.ibm.com/​hadoop/​2017/​01/​06/​start-flume-agents-using-ambari-web-interface/​|Flume in Ambari]] [[https://​developer.ibm.com/​hadoop/​2017/​01/​06/​start-flume-agents-using-ambari-web-interface/​|Flume in Ambari]]
-====== Apache Storm ======+
  
  
Line 178: Line 183:
   * use a small subset of your data   * use a small subset of your data
   * use  --run-mode local   * use  --run-mode local
 +  * enable debugging in conf/​log4j.properties <​code>​org.locationtech.geomesa.convert=debug </​code>​
   * set the error mode to '​raise-errors'​ in the converter options: https://​www.geomesa.org/​documentation/​user/​convert/​parsing_and_validation.html#​error-mode   * set the error mode to '​raise-errors'​ in the converter options: https://​www.geomesa.org/​documentation/​user/​convert/​parsing_and_validation.html#​error-mode
  
Line 296: Line 302:
  
 hdfs-audit.log files hdfs-audit.log files
 +
 +
 +
 +====== Clouds ======
 +
 +https://​aws.amazon.com/​pricing/​
 +
 +https://​www.cloudera.com/​products/​pricing.html
 +
 +https://​azure.microsoft.com/​en-us/​pricing/​
 +
  
  
Line 309: Line 326:
  
 [[https://​www.computerworld.com/​article/​3428025/​everything-you-need-to-know-about-the-new-cloudera-data-platform--vision--migration-and-roadmap.html|roadmap for merging HDP into Cloudera]] [[https://​www.computerworld.com/​article/​3428025/​everything-you-need-to-know-about-the-new-cloudera-data-platform--vision--migration-and-roadmap.html|roadmap for merging HDP into Cloudera]]
 +
 +==== HDP Spark Developer DEV-343 ====
 +
  
 [[https://​www.cloudera.com/​about/​training/​courses/​hdp-spark-developer.html#?​classType=virtual|HDP Spark Developer DEV-343]] [[https://​www.cloudera.com/​about/​training/​courses/​hdp-spark-developer.html#?​classType=virtual|HDP Spark Developer DEV-343]]
 +
 +DataFrames/​DataSets/​RDD,​ shuffling, transformations and performance tuning, streaming
  
   * dataOps, analysts   * dataOps, analysts
Line 316: Line 338:
   * 3200 $ virtual classroom or   * 3200 $ virtual classroom or
   * 3500 € Paris   * 3500 € Paris
 +
 +==== HDP Data Science SCI-241 ====
  
 [[https://​www.cloudera.com/​about/​training/​courses/​applying-data-science-using-apache-hadoop.html|HDP Data Science SCI-241]] [[https://​www.cloudera.com/​about/​training/​courses/​applying-data-science-using-apache-hadoop.html|HDP Data Science SCI-241]]
Line 325: Line 349:
   * on request   * on request
  
-[[https://​university.cloudera.com/​content/​cca175|CCA Spark and Hadoop Developer ​(CCA175)]]+==== Cloudera Certified Associate ​(CCA ====
  
-$ 295.00+[[https://​www.cloudera.com/​about/​training/​certification.html|Cloudera Certified Associate (CCA)]] consists of 3 parts: 
 + 
 +  * [[https://​university.cloudera.com/​content/​cca175|CCA Spark and Hadoop Developer exam]] 
 +  * [[https://​www.cloudera.com/​about/​training/​certification/​cca-data-analyst.html|CCA Data Analyst exam]] 
 +  * [[https://​www.cloudera.com/​about/​training/​certification/​cca-admin.html|CCA Administrator exam]] 
 + 
 +each exam costs $295
  
 [[https://​www.cloudera.com/​about/​training/​course-listing.html#?​course=all|course locations]] [[https://​www.cloudera.com/​about/​training/​course-listing.html#?​course=all|course locations]]
Line 335: Line 365:
 https://​www.coursera.org/​courses?​query=apache%20spark https://​www.coursera.org/​courses?​query=apache%20spark
  
 +===== DataBricks =====
  
 +https://​academy.databricks.com/​catalog
  
 +[[https://​academy.databricks.com/​instructor-led-training/​DB301|DB 301 - Apache Spark™ for Machine Learning and Data Science]] ​
 +  * 3 days
 +  * virtual classroom
 +  * $2500
 +
 +
 +short, self-paced courses based on AWS or Azure: $75
 +
 +[[https://​academy.databricks.com/​category/​certifications|certifications]]
 ===== Dell ===== ===== Dell =====
  
 [[https://​education.dellemc.com/​index_downtime.htm|site down]] [[https://​education.dellemc.com/​index_downtime.htm|site down]]
 +
 +
 +example: [[https://​education.dellemc.com/​content/​dam/​dell-emc/​documents/​en-us/​E20_065_Advanced_Analytics_Specialist_Exam.pdf|Specialist -Data Scientist, Advanced Analytics Version 1.0]]
 +
 +===== Google =====
  
 ===== Heinlein ===== ===== Heinlein =====
Line 358: Line 404:
  
 [[https://​www.coursera.org/​specializations/​advanced-data-science-ibm|Advanced Data Science with IBM Specialization]] [[https://​www.coursera.org/​specializations/​advanced-data-science-ibm|Advanced Data Science with IBM Specialization]]
- 
- 
-===== DataBricks ===== 
- 
-https://​academy.databricks.com/​catalog 
  
  
Line 371: Line 412:
  
 [[https://​mapr.com/​training/​certification/​|The MapR Academy Certification Program is closed to new registration as we work to update the exams.]] [[https://​mapr.com/​training/​certification/​|The MapR Academy Certification Program is closed to new registration as we work to update the exams.]]
 +
 +===== Microsoft Azure =====
 +
 +
  
 ===== SimpliLearn ===== ===== SimpliLearn =====
Line 378: Line 423:
  
  
-Aligned to Cloudera CCA175 certification exam+Aligned to Cloudera ​[[https://​www.cloudera.com/​about/​training/​certification/​cca-spark.html|CCA175]] certification exam
 ====== Meetups ====== ====== Meetups ======
  
 https://​www.meetup.com/​Wien-Cloud-Computing-Meetup/​ https://​www.meetup.com/​Wien-Cloud-Computing-Meetup/​
  
-https://viennadatasciencegroup.at/category/events/+https://www.meetup.com/Big-Data-Vienna/​ 
 + 
 +https://​www.meetup.com/Vienna-Data-Science-Group-Meetup/
  
 https://​www.ocg.at/​cloud-computing-big-data https://​www.ocg.at/​cloud-computing-big-data
  
bigdata.1567696900.txt.gz · Last modified: 2019/09/05 17:21 by mantis