Aircraft Journey Summary Extractor

Convert aircraft journey summary forms (images) into structured JSON for downstream use.

Assumptions, Scope, and Constraints

Assumptions

Form Structure: Forms follow a consistent template with labeled fields (Aircraft Model, Registration Number, Departure/Arrival Airports, Crew, Fuel, Load, Defect Message)
Language: Text is primarily English with standard aviation abbreviations (FWD, GND, A/C, DD, MEL)
Image Quality: Photos are reasonably clear (smartphone quality or better)
Volume: Processing 2-3 forms per flight, with typical airline operations scale (<100K forms/month)
Internet Access: System has reliable internet connection for API calls

Scope

Extract structured data from 1-3 sample images
Handle both printed and handwritten text
Output valid JSON with required fields
Basic error handling for missing files and invalid images

Constraints

Not included: Advanced UI, production hardening, authentication
API Dependency: Requires Google API key and internet access
Accuracy: ~95% on clear handwriting, lower on illegible text
Processing Time: 2-3 seconds per form

Approach and Key Design Choices

Selected Approach: Google Gemini 2.0 Flash (Vision API)

Why this approach?

Single-step extraction: No need for separate OCR → parsing → structuring pipeline
Handles handwriting well: Better than traditional OCR (Tesseract) on cursive/messy text
Cost-effective: ~$0.001 per form vs ~$0.01 for GPT-4 Vision
Domain-aware: Can be prompted with aviation terminology for better accuracy

Alternatives Considered (and why rejected)

Approach	Why Rejected
Traditional OCR (Tesseract) + NLP	Poor handwriting recognition (~60% accuracy). Would require complex 4-stage pipeline: OCR → text cleanup → parsing → structuring
AWS Textract	Good for printed text but struggles with handwriting. Lacks domain context for aviation abbreviations. Higher cost (~$0.0015/form)
GPT-4 Vision	Excellent accuracy but estimated 10x more expensive (~$0.01/form). Overkill for this use case

Key Design Decisions

1. Prompt Engineering

Give model aviation domain expertise: "You are an expert aircraft maintenance technician"
Provide common abbreviations explicitly: FWD, GND, A/C, DD, MEL
Result: Improved handwriting accuracy from ~60% to ~95%

2. Error Handling

Graceful degradation: One failed form doesn't block batch processing
Clear error messages for missing files, invalid images, API failures

3. JSON Validation

Basic field presence checking
Type validation (crew/passengers as integers)
Clean markdown artifacts from API responses

JSON Output for Provided Sample Image

Output

{
  "aircraft_model": "Airbus A320",
  "registration_number": "9M-XX1",
  "departure_airport": "KUL",
  "arrival_airport": "SIN",
  "crew": "4",
  "passengers": null,
  "fuel": "12k",
  "load": "150",
  "defect_message": "VA.7 SDML DD 244161 ANY DD 244162 A/C USS FEED CONTROL AND A/C ESS FEED PRY SUO FOUL LT TO BE CHECKED OPERATIVE."
}

API Sketch

Request

POST /api/v1/extract
Content-Type: multipart/form-data

{
  "image": <file>
}

Parameters:

image (required): Image file (JPEG, PNG, WebP)

Response (Success)

{
  "status": "success",
  "data": {
    "aircraft_model": "Airbus A320",
    "registration_number": "9M-XX1",
    "departure_airport": "KUL",
    "arrival_airport": "SIN",
    "crew": 4,
    "passengers": null,
    "fuel": "12k",
    "load": "150",
    "defect_message": "..."
  }
}

Response (Error)

{
  "status": "error",
  "error_code": "INVALID_IMAGE",
  "message": "Unable to process image: file not found"
}

Project Structure

aircraft-journey-extractor/
├── README.md                          # This file
├── requirements.txt                   # Python dependencies
├── aircraft_extractor.ipynb           # Main implementation
├── images/
│   └── sample_image.jpeg             # Sample form
└── extraction_results.json            # Output

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aircraft Journey Summary Extractor

Assumptions, Scope, and Constraints

Assumptions

Scope

Constraints

Approach and Key Design Choices

Selected Approach: Google Gemini 2.0 Flash (Vision API)

Alternatives Considered (and why rejected)

Key Design Decisions

JSON Output for Provided Sample Image

Output

API Sketch

Request

Response (Success)

Response (Error)

Project Structure

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
image		image
README.md		README.md
aircraft_extractor.ipynb		aircraft_extractor.ipynb
extraction_results.json		extraction_results.json
requirements.txt		requirements.txt

razeeniqbal/aircraft-journey-summary-extractor

Folders and files

Latest commit

History

Repository files navigation

Aircraft Journey Summary Extractor

Assumptions, Scope, and Constraints

Assumptions

Scope

Constraints

Approach and Key Design Choices

Selected Approach: Google Gemini 2.0 Flash (Vision API)

Alternatives Considered (and why rejected)

Key Design Decisions

JSON Output for Provided Sample Image

Output

API Sketch

Request

Response (Success)

Response (Error)

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages