Searching...
lunes, 24 de julio de 2023

Recursos Data Engineer

---------------------- ARTICULOS SPARK SCALA ---------------------- https://sparkbyexamples.com/spark/spark-setup-run-with-scala-intellij/

https://medium.com/@Sushil_Kumar/setting-up-spark-with-scala-development-environment-using-intellij-idea-b22644f73ef1

Instalar Java 8 en Ubuntu 
- https://computingforgeeks.com/how-to-install-java-8-on-ubuntu/ 

Instalar intellij-idea en Ubuntu 20.04 LTS 
- https://youtu.be/zCkozcDBJ-A 

Instalar plugin Scala y crear Proyecto en IntelliJ 

Mi primera aplicacion Apache Spark en Scala con Maven en IntelliJ 

Create Scala Project with Maven in IntelliJ IDEA


Spark Scala con Maven en IntelliJ 

Configurar Scala SDK Modificar fichero pom.xml con version de scala y dependencias spark - https://github.com/martinprobson/Spark-Scala-Maven-Example/blob/master/pom.xml 

Scala con Maven 
- https://docs.scala-lang.org/tutorials/scala-with-maven.html#the-scala-maven-plugin 

Scala on IntelliJ using Maven


Crear App Spark usando el IDE de IntelliJ en Ubuntu 
- https://youtu.be/AP5JvdgiBME 

Usar Gradle para crear una aplicación simple en Spark Scala 
- https://medium.com/@yanggao1119/using-gradle-to-create-a-simple-scala-spark-application-d5f1740e741b 

Crear de manera rapida un proyecto scala usando gradle en intellij 
- http://www.legendu.net/misc/blog/scala-with-gradle-in-intellij/ 

Iniciarse en Gradle con IntelliJ 
- https://www.jetbrains.com/help/idea/getting-started-with-gradle.html 

Generar un fichero JAR de una aplicación Scala con SBT 
- https://alvinalexander.com/scala/sbt-how-build-single-executable-jar-file-assembly/ 

 Como configurar y estructurar una aplicación Spark en Scala 


Create your first scala project with Maven - https://docs.scala-lang.org/tutorials/scala-with-maven.html

Create scala project with maven in IntelliJ Idea - https://www.javahelps.com/2021/03/create-scala-with-maven-project-in.html

Setting up a Spark development environment with Scala

Instalar Apache Spark y Scala

Setup Development Environment - Scala and Spark


Creating a Spark Standalone Cluster with Docker and Docker Compose


Learning Scala on MacBook with MacOS 12 (Monterey)


Setup homebrew on macOS Monterey


Install Java on Mac with brew 


Setup Scala Spark project in IntelliJ using SBT build tool

 
Kernel de Scala para Jupyter

Instalar Spark en Ubuntu

Spark setup with Scala and Run in IntelliJ

https://sparkbyexamples.com/spark/spark-setup-run-with-scala-intellij/?expand_article=1

——— LEER CONFIGURACIONES ——-

Configuration Library for JVM Languages - https://github.com/lightbend/config



Hoy to use the Lightbend Config library In a Scala of Java application - https://alvinalexander.com/scala/how-use-lightbend-config-library-scala-java-application/



 ----- IDEs --—
Working with Scala on IntelliJ 
Compile and build applications with IntelliJ 
Write and edit source code 

————- JAVA —————————————————————-
Java full course for beginners (using IntelliJ) - https://youtu.be/Qgl81fPcLc8

 ----------------------------- PROGRAMA DE FORMACION DEVOPS -----------------------------
 Git, GitHub and Gitlab - Curso de Atlassian en Coursera: Version Control with Git CI/CD Pipeline with JEnkins - Cursos Udemy: - Jenkins, from zero to hero. Become a DevOps Jenkins Master - GitLab CI: Pipelines, CI/CD and DevOps for Beginners Configuration Management with Chef, Puppet and Ansible - Ansible for the absolute beginner - Hands-on - DevOps - Curso Udemy: Dive into Ansible - From begginer to expert - Todos los cursos en kodekloud.com Docker Certified Associate Certification - Todos los cursos en kodekloud.com - Instalar Docker en Ubuntu: https://docs.docker.com/engine/install/ubuntu/ - Curso Docker y Devops: de novato a experto: https://www.udemy.com/share/102RT4BUQdeF5UQX4=/ - Desplegar una aplicación NodeJS con Docker: https://www.luigicolella.it/blog/how-to-create-nodejs-app-with-docker-1 - Dockerizar una App Scala: https://medium.com/@ievstrygul/dockerizing-scala-app-3fdf08cffda4 - dockerizar una aplicación Django - https://platzi.com/blog/django-docker/ - Como crear imagenes docker para Django - https://www.section.io/engineering-education/django-docker/ - Spark cluster with docker and docker-compose: https://github.com/fgarcialopez605/spark-essentials/tree/master/spark-cluster Certified Kubernetes Administrator - Todos los cursos en kodekloud.com Devops on AWS - Curso Udemy: AWS Certified DevOps Engineer (Requiere al menos el AWS Certified Developer) 

Apache Spark Cluster on Docker:

Spark clúster with docker


Apache Spark Standalone Cluster on Docker

https://github.com/cluster-apps-on-docker/spark-standalone-cluster-on-docker#tech-stack

 ---------------------------- PROGRAMA CERTIFICACION CLOUD ----------------------------

AWS Cloud Practitioner Certification 
- https://www.whizlabs.com/aws-certified-cloud-practitioner/

AWS Certified Cloud Practitioner

Official Google Cloud Certified Professional Data Engineer Study Guide

AWS Certified Solutions Architect

AWS Certified Solutions Architect Study Guide

Prepare for AWS Solution Architect Associate Certification


Como superar el examen de data Engineer de Google cloud - https://dbibyhavas.io/es/blog/data-engineer-de-google-cloud/

Desarrollo Python en Google Cloud 

Desarrollo Java en Google Cloud

—————————- PROGRAMA CERTIFICACIÓN SPARK ——————————-

Como aprobar la certificación Databricks Spark 3.0

 ------------------- MAQUINAS VIRTUALES ------------------- 
 Install Ubuntu 20.04 LTS on VirtualBox in Windows 10 - https://youtu.be/x5MhydijWmc - https://gadgetliv.com/install-ubuntu-on-virtualbox-in-windows/ 

How to install Ubuntu in VirtualBox on Windows 10


Como empezar a trabajar con un contenedor Docker de Cloudera QuickStart 

Proyecto Vagrant para arrancar una máquina virtual con Hadoop/Spark

 -------------------- DESARROLLO PYTHON --------------------
 Jupyter Notebooks, JupyterLab, Zeppelin, Anaconda, Spyder 
 
 Iniciar un proyecto de ciencia de datos con Anaconda en Python - http://spaceanalytics.blogspot.com/2020/11/iniciar-un-proyecto-de-ciencia-de-datos.html 

Empezar a trabajar con Apache Spark y Python

 
 Introducción práctica al analisis de datos en python 
 
Instalar un entorno de desarrollo Python3 en Ubuntu 20.04 
 
Creación de proyectos Python en IntelliJ 

Introducción a MongoDB, PySpark y Jupyter Notebook

Setup PySpark environment on Ubuntu

Setup JupyterLab Environment on Ubuntu

Install PySpark on Ubuntu



Install PySpark locally and use it with Jupyter Notebook

PySpark crash course in 50 minutes

PySpark Tutorial for Beginners

Instalar Anaconda Python en Ubuntu 20.04

Documentación oficial de Anaconda

JupyterLab in a virtual environment 

Use Virtual Environments Inside Jupyter Notebooks & JupyterLab

Create Django App on Ubuntu Server

Documentación oficial de Conda

Como instalar Python 3 y entornos virtuales con miniconda en Ubuntu

Gestionar múltiples versiones de Python en Ubuntu

 Otros ----- http://www.blogdelujo.com/2010/01/ver-incrustar-otra-web-en-blog.html

—————————DESARROLLO KAFKA ——————-

Desarrollo de aplicaciones Kafka con Scala - 


———— KUDU / IMPALA —————

Montar una infraestructura Docker que usa Impala para operar con Kudu

Oficial Kudu - https://kudu.apache.org/



Building real time BI systems with Kafka, Spark and Kudu - https://databricks.com/session/building-real-time-bi-systems-with-kafka-spark-and-kudu

Apache Kudu and Spark SQL for fast Analytics on fast data - https://databricks.com/session/apache-kudu-and-spark-sql-for-fast-analytics-on-fast-data

CERTIFICACIÓN GCP DATA ENGINEER


Usar Google Dataproc con PySpark y Jupyter Notebook


Run PySpark job on Dataproc


GENERAL DATA ENGINEER



Setting up a Dev Environment for Data Engineers


Data Architecture Articles


How to become a Data Architect


Top Data Architect and Data Engineer Certifications

0 comentarios:

Publicar un comentario

Gracias por participar en esta página.

 
Back to top!