Important Links
Problem Statement
Your challenge is to build a resume extractor that can automatically parse resumes in multiple formats (JPG, PDF, DOC/DOCX) and output the extracted details in a structured JSON format.
Input:
Resumes in different formats (scanned images, PDFs, Word documents).
Expected Output:
A sample JSON object with the following fields:{"name": "","email": "","phoneNumber": "","highSchoolName": "","highSchoolAddress": "","highSchoolGpaOrPercentage": "","highSchoolGpaScale": "","highSchoolBoard": "","highSchoolGraduationYear": "","ugCollegeName": "","ugCollegeAddress": "","ugCollegeGpaOrPercentage": "","ugCollegeGpaScale": "","ugUniversity": "","ugGraduationYear": "","ugDegree": "","ugMajor": "","pgCollegeName": "","pgCollegeAddress": "","pgCollegeGpaOrPercentage": "","pgCollegeGpaScale": "","pgUniversity": "","pgGraduationYear": "","pgDegree": "","pgMajor": "","certifications": [],"extraCurricularActivities": [],"workExperience": [],"researchPublications": [],"testScores": {"sat": "","act": "","gre": "","gmat": "","toefl": "","ielts": ""},"achievements": []}
Key Requirements:
- The agent must handle different resume formats and layouts (chronological, functional, hybrid).
- For image-based resumes (JPG, scanned PDFs), implement OCR (Optical Character Recognition) before extraction.
- Handle variations in how academic and career details are presented.
- Extract only relevant information, minimizing false positives.
- Output should be clean, consistent, and ready for downstream processing.
Evaluation Criteria:
- Accuracy – Correct extraction of details across different formats.
- Robustness – Ability to handle noisy data (scans, unconventional layouts).
- Completeness – Coverage of as many fields as possible from the schema.
- Scalability – Performance on bulk resumes.
- Automation – Minimal human intervention required.
- Round 1 -Team Registration & Domain Selection (PPT Saved as PDF to submit)
- Round 2 -Submission & Showcase (PPT Presentations)
Note: The hackathon will be conducted entirely in online mode.
Round 1 – Team Registration & Domain Selection
- Hackathon Submission Window: 15 Oct – 14 Nov 2025
- Round 1 Result : 18th November 2025
- Use the given PPT for your reference(Click Here)
- All team members must enroll and submit the Round 1 assignment in PPT saved as PDF format.
- All team members will receive e-certificates
Round 2 – Submission & Showcase
- Round 1 – Shortlisted teams only
- Submission Window: 19th November 2025– 5th December 2025
- Presentation Dates: From 10th December 2025
- Use the given PPT for your reference
- Submit your PPT must be saved in PDF format.
Round 2 - Results will be announced on December 17th 2025.!
