Skip to content

Question on validation dataset creation/agent pipeline. #1290

@mmrbulbul

Description

@mmrbulbul

For the Kaggle agent, during dataset preparation, we create test and train data by splitting the original train data. If I'm not mistaken, this newly created test data is being used in step four of the pipeline

Step 4 : Validation on Test Set or Kaggle 📉

  • Validate the newly developed model using the test set or Kaggle dataset.
  • Assess the model’s effectiveness and performance based on the validation results.

Given the creation of validation dataset is almost as important as model creation and requires understanding of the data, shouldn't it also be part of the pipeline?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions