Features
Description
InfoShare Academy is a leading IT academy offering comprehensive educational programs in new technologies for companies. Since 2015, we have supported organizations in developing technology teams through dedicated courses in Machine Learning, DevOps, Data Engineering, Python, UX/UI Design, AWS, and Kubernetes. Our training is based on practical skills and real business cases. We collaborate with over 300 industry practitioners, ensuring that our programs are tailored to current market needs. We specialize in reskilling and upskilling employees. With us, you will build effective teams implementing new technologies that will accelerate innovation and strengthen your company's competitiveness in the market. Check out our training offerings designed to develop your employees' IT competencies.
The multitude of modules, versatile support, and ease of integration with web services make Python one of the most popular tools in the field of Data Science. Participants in this course will have the opportunity to learn about the two most essential libraries – NumPy and Pandas, and see their application in working with diverse data. The training requires knowledge of the basics of the Python language.
- For Python programmers looking to expand their skills in the field of Data Science
- For individuals developing towards work in machine learning and artificial intelligence
- For data analysts needing tools for implementing and automating their analyses and algorithms
- You will learn how to carry out the complete process from loading data, through transformations, to sharing results
- You will gain a thorough understanding of the basic element, which is multidimensional arrays in NumPy
- You will learn how to work with tabular data using the Pandas library
- You will discover how to load and integrate data from various sources and how to automate the analysis process
- You will see how to apply the acquired knowledge in further work towards machine learning
Module 1: Computational and Algorithmic Tools – NumPy and SciPy Libraries
Working with numerical data
Characteristics of multidimensional arrays in NumPy
Using libraries for scientific and engineering calculations
Module 2: Integration with Data Sources
Working with relational databases (MySQL, PostgreSQL)
Working with Excel sheets
Module 3: Basics of Working with Tabular Data – Pandas Library
Loading data from various sources
Structure of the Pandas DataFrame object
Built-in methods for performing typical analyses
Data operations and their automation
Data visualization using Matplotlib and Seaborn
Exporting results and demonstrating reporting tools
Module 4: Discussion of Next Steps Towards Machine Learning
Data cleaning and transformation process
Demonstration of using the above knowledge with Scikit-learn and TensorFlow/Keras libraries
16 h/2 days
- Certificate of completion
- Monthly access to the training recording (in case of online format)
- Customization of the training program to the client's needs