Streaming
| Skills | Description | Current Performance State |
|---|---|---|
| Airflow | - Task 1 : piped about 1128K of csv data daily - Task 2 : streamed and stored crawling data hourly. |
Tuning Parallel Processes |
| Spark | - Tested Job submit through pyspark in the spark clustering.
- Visualized in zeppelin. - Distributed processing |
Done |
| Web + Proxy Server | - Constructed load balancing between web servers and nginx - Edited shell script to automated crontab and updating frontend files for blog template |
Done |
| Ngrinder | - Experiments for 3 days : How efficient are the distributional computing - Environment of simple load balancing between web servers |
Done |
| DevOps + Jenkins | - Compose Docker files and glancing the Monitoring task, and building CI/CD between gitOpsand Jenkins | Done |
| Personal Project ver.1 | - Real Time Data Streaming - Through News and Stock Price data, Micro-Service FastAPI - Applied PageRank Algorithms for keywords counts |
Ready to build MLops |
| Analysis | Bert Modeling ( Transformer + classifier : Running ) , Vision | Implemented |
| Project Leader | H/W + Mask Recognition + Product manager | 2nd Awarded |
Juny
Hard-working, quick learner, enthusiastic and enjoys working both individually and in group. Pursuing the realistic and creative product to make !
Personal Project
https://github.com/Juny2312/Real-Time-Clustering