A Workshop on Feature Engineering/Feature Selection and Assemble Approaches in Machine Learning

Date

October 26, 2024

Location

Virtual Meet

Process Point : Online Faculty Development Program

Introduction

The recent workshop conducted by Shri Vishnu Engineering College for Women, aimed at faculty and students from universities across India, provided an in-depth exploration of essential concepts and practical applications in Data Science and Machine Learning. Our Head of Data Science participated in this workshop, covering a wide range of topics, from feature engineering to ensemble learning. The sessions aimed to bridge the gap between human intuition and machine understanding. Below is a detailed overview of the workshop sessions and their key takeaways.

Session Breakdown

Session 1: The Human-Machine Gap

  • The Machine Learning Challenge: Discussed the complexities involved in enabling machines to interpret human language and descriptions.
  • Understanding Descriptions: Explored how machines can comprehend and process human language.
  • Key Insight: Introduced the concept of feature engineering as a critical challenge in machine learning.

Session 2: Feature Engineering & Ensemble Learning

  • Feature Engineering: Defined as the process of selecting and transforming variables when creating a predictive model.
  • Ensemble Learning: Introduced as a technique that combines multiple models to improve prediction accuracy.
  • Example Exercise: Provided hands-on experience with real-world data.

Session 3: Human vs Machine Thinking

  • Translating Intuition to Features: Discussed how human intuition can be transformed into machine-readable features.
  • Common Pitfalls: Identified common mistakes in feature engineering that can hinder model performance.

Session 4: Feature Engineering Techniques

  • Temporal Feature Engineering: Covered techniques for handling time-based data with practical examples.
  • Numerical Feature Engineering: Discussed scaling techniques to normalize numerical data.
  • Categorical Feature Engineering: Explored methods for converting categorical variables into numerical formats.
  • Practice Exercise: Engaged participants in applying learned techniques.

Session 5: Feature Selection Methods

  • Filter Methods: Explained methods that evaluate features based on their statistical properties with real-world applications.
  • Wrapper Methods: Introduced methods that consider the performance of a model using subsets of features.
  • Embedded Methods: Focused on LASSO and Ridge Regression for feature selection.
  • Interactive Exercise: Participants engaged in hands-on feature selection activities.
  • Best Practices: Shared do’s and don’ts for effective feature selection.

Session 6: Ensemble Learning

  • Combining Models for Better Predictions:
    • Bagging (Bootstrap Aggregating): Techniques to reduce variance by training multiple models.
    • Boosting (Learning from Mistakes): Focused on improving model accuracy by sequentially training models.
    • Stacking (Meta-Learning): Combining predictions from multiple models to enhance overall performance.
  • Hands-on Exercise: Participants built their own ensemble models.
  • Best Practices: Discussed when to use each ensemble method effectively.

Conclusion

The workshop successfully equipped participants with foundational knowledge and practical skills necessary for navigating the complexities of data science and machine learning. By emphasizing feature engineering, model evaluation, and real-world applications, attendees gained valuable insights that will aid them in their academic and professional pursuits. As the demand for expertise in these fields continues to grow, this workshop served as an essential stepping stone for faculty and students alike, fostering a deeper understanding of how to leverage data science and machine learning effectively. We look forward to seeing how participants apply these insights in their future projects!

Feature Engineering Presentation

Featured Speakers

Sydney Lewis

Head Of Data Science
Process Point Technologies

Dr. D. Venkata Naga Raju

Professor & HOD
Department of Information Technology

Gallery

Key Metrics for AI-Driven Success

Process Point leverages cutting-edge AI/ML technologies to enhance operational efficiency in process industries. Our solutions deliver measurable results, ensuring high satisfaction rates, robust attendance at our sessions, and comprehensive planning for success.

Satisfaction Rate

99.5%

Expected Attendees

200+

Sessions Planned

6