title of presentation - imdr-+roussel... · 2019-02-27 · ecam : cond lav+galley fan fault …...

21
Aircraft maintenance logbooks analysis David ROUSSEL, Airbus Group Innovations, Computational Intelligence & Services Fabrice LATCHURIE, Airbus, In-Service Data Management Des méthodes aux applications du TAL dans le retour d’expérience Journée IMdR du 16 mai 2017

Upload: others

Post on 18-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Aircraft maintenance logbooks analysis

David ROUSSEL, Airbus Group Innovations, Computational Intelligence & Services

Fabrice LATCHURIE, Airbus, In-Service Data Management

Des méthodes aux applications du TAL dans le retour d’expérience

Journée IMdR du 16 mai 2017

Summary

• Overview of Aircraft In-Service data management

• What are Aircraft Logbook coding tasks ?

• Logbook auto-coding investigations with Machine Learning

• Assisted coding industrial implementation with dictionaries and rules

• Lesson learnt and Perspectives

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 2

In service data management & coding

Benchmark, identify

weaknesses

solutions Improve the

fleet

In-service data

Change considered

Text Mining and Statistical Modules plugged into the in-service

data coding processes

Impact

Accuracy of codes and results

Proposed solutions

Benefits

Reduce time to process the in-service data

in the context of Airbus fleet growth

Increase quality of the in-service data processing

Airlines

logbooks

coding

End-to-end approach to address multi program

in-service issues

Objective is to efficiently drive the Fleet Performance Improvement, based on:

250 operators reporting data for

22,000 flights per day for an average of

9000 interruptions per month (delays, flight cancellation, in-flight turn back, diversion)

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 3

identify, develop and validate data modules for modifications

Deliver contextualized documentation

Manage/maintain documentation & preconditions

Maintenance

Planning

Maintenance

Execution

Maintenance Monitor and

Control

Technical

requests

Instant Monitoring

and advices

In Service Technical Event

Management Post Design Services

Technical

Publications

Maintenance logbooks (from

MPD, Pirep, Carep, Marep…)

Reliability Monitoring

Unscheduled

Maintenance

Aircraft reports

(from CMS, ACMS, …)

Processes overview

What are Aircraft Logbook coding tasks ?

• Manual coding of these feedbacks has been performed by inspecting

textual fields which are providing a “description” and “work performed”.

• These text fields are unstructured, can contain abbreviations, spelling and

grammatical mistakes (of non native speakers), etc.

After data collection, “coding” means enriching the raw text data by fields

like:

ATA chapters (based on ATA-100 standard)

Custom codes (Maintenance action, Finding, Symptoms/Faults,

Operational Phase, Effects)

Related events (e.g. ECAM message)

LAVATORY GALLEY FAN FAULT.AFT

GALLEY FAN RPLCD.

From unstructured data (free text)

To structured data

ATA 6D: 21-23-51

ACTION: Replaced

ECAM : COND LAV+GALLEY FAN FAULT

… Other TAGS

Coding

Enrichment with related event

Categorization (hierarchical)

Unique or multiple tagging

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 5

Auto-coding investigations with Machine Learning

Textual pre-

processing

Auto-Classification

models learning Historical coded Data

Coding

(with confidence)

Assisted-Coding for

confident cases

(rules)

Cross-evaluation

- 16/05/2017 6

New data

Most representative

contributors

ATA codes

(e.g. one year A320)

Fault complex codes

representation

(e.g. small airline data)

MAIN CHALLENGES:

• Code values are unbalanced

• Non-native written Technical English

• Complex codes (combinations) are under-represented

• Coding practices may vary between coders, are subject to errors

• Empty code values are difficult to interpret (missing codes 3.3 %)

Des méthodes aux applications du TAL dans le retour d’expérience

ATA-4D by Airline

Action-Finding

Effect

Fault

Rule-based fusion

Classifiers

Effect-Phase

Coupled codes

Machine Learning approaches tested

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 7

Coded field ATA–4D

by Airline

(660

codes)

Action

(15)

Finding

(38 )

Fault

(30)

Phase

(15)

Effects

(16)

% Success Overall 82%

85% 73% 70% 70% 62%

% Success with high

confidence

55%

13 % 36% 19%

No improvement expected

Good hopes for improvement

% Success Overall – refers to the % of correctly coded

cases in the sample

(but where the errors are is unknown)

% success with high confidence means the sub-set of

coded cases which are closed to expert coding

(success > 95 %)

CONCLUSION was to investigate dictionary-based approach + business rules to provide suggestions

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 8

First results obtained

Defining a dictionary and elucidating business rules

A dictionary is intuitively more relevant for multiple tagging (combinations)

and ATA chapters. It gives rise to better description of the codes.

It enables refinement of coding schemes (e.g. ATA 4 digits ATA 6 digits)

by supporting IHM improvements for codes selection.

Unique code selection is performed by explicit prioritization rules

(e.g. replacement action > inspection, 291151 Ata chapter > 291100, …)

Contextual rules are used to detect positive or negative textual contexts

(e.g. for disambiguation, syntactic variants, negation, mentions of postponed action...)

To ease dictionary creation, substitutions are used to normalize the raw text

(e.g. RPLCD REPLACED).

Fuzzy matching is used for compound terms and mentions to Aircraft messages.

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 9

Illustration of lexical ambiguity between actions

Adjusted and Serviced Current status: 6500 substitutions, 9500 rules

290000 HYDRAULIC + LEAK

291100 GREEN + EMP

291100 GREEN + ENGINE 1

291100 GREEN + HYDRAULIC + SYSTEM

291100 GREEN + PRESSURE

291100 ENGINE 1 + HYDRAULIC

291114 GREEN + MANIFOLD

291114 1011GM

291114 1111GM

291115 GREEN + PTU + MANIFOLD

291115 1013GM

291115 1113GM

291117 GREEN + PRESSURE + SWITCH

291117 SWITCH + PUMP

291117 1074GK

291117 1074GK

291121 GREEN + AIR + ACCUMULATOR

291121 1072GM

291122 GREEN + ACCUMULATOR + CHARGE + VALVE

291122 1071GM

291132 GREEN + PRESSURE + RELIEF + VALVE

291132 1063GM

291133 GREEN + PRIORITY + VALVE

291133 1064GM

291134 GREEN + SAMPLING + VALVE

291134 1187GM

291135 GREEN + CHECK + VALVE + REVERSER

291135 3008KM1

291135 3008KM1

291135 3009KM1

291135 3010KM1

291136 GREEN + CHECK + VALVE + PUMP + DELIVERY

291136 1050GM

291136 1060GM

291137 GREEN + CASE + DRAIN

291137 1041GM

291138 GREEN + COUPLING + ACCUMULATOR

291138 1672GM

291139 GREEN + HALF + COUPLING

291139 1038GM

291139 1700GM

291139 1700GM

291141 GREEN + LOW + AIR + PRESSURE

291141 GREEN + RESERVOIR

291141 1000GQ

291142 GREEN + ACCUMULATOR

291142 1070GM

291143 EDP + FILTER + ENGINE 1

291143 EDP + FILTER + LEFT

291143 EDP + FILTER + GREEN

291143 1084GM

291143 1086GM

291144 GREEN + FILTER + LOW + PRESSURE

291144 1002GM

291144 1030GM

291145 GREEN + FILTER + HIGH + PRESSURE

291145 1048GM

291146 GREEN + DAMPENER

291146 1085GM

291147 GREEN + SLIDE + COMPENSATOR

291147 1010GM

291148 GREEN + CHECK + VALVE + WTB

291148 1170GM

291148 1410GM

291148 1411GM

291149 GREEN + HYDRAULIC + ENGINE + TUBE

291149 HYDRAULIC + ENGINE 1 + TUBE

291151 EDP + ENGINE 1

291151 EDP + LEFT

291151 ENGINE 1 + PUMP

291151 GREEN + EDP

291151 GREEN + PUMP

291151 1030GK

291152 GREEN + ENGINE + FIRE + VALVE

291152 1046GK

291153 GREEN + DAMPER + EDP

291153 1600GM

291163 GREEN + GROUND + MANIFOLD

291200 BLUE + HYDRAULIC + SYSTEM

291200 BLUE + PRESSURE

291214 BLUE + MANIFOLD

291214 2011GM

291215 BLUE + ELEC + PUMP

291215 2706GJ

291217 BLUE + PRESSURE SWITCH

HYDRAULIC LK ON ENG#1 EDP,

RPLCD #1 HYD PMP OPS AND LK CK

GOOD

Text normalization Matching Ranked suggestions

From

Abbreviation and most current

misspelling errors

AVIONCS

AVCS AVIONIC

AVNCS AVNX

AVNXS

TO

Normalized text

AVIONICS

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED 1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED 1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED 1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED 1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED 1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

HYDRAULIC LEAK ON ENGINE 1 EDP

REPLACED #1 HYDRAULIC PUMP OPS

AND LEAK CHECK GOOD

290000 HYDRAULIC + LEAK

291100 ENGINE 1 + HYDRAULIC

EDP + ENGINE 1

ENGINE 1 + PUMP

Human decision making

Assisted coding process

291151

- 16/05/2017 10

IHM of assisted coding

Related warnings are

ranked by fuzzy

matching score

Text is highlighted regarding

the selected coding field

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 11

Dictionary + rules modeling and test

match

3A - HYDRAULIC LK ON ENGINE 1

EDP TEST.

REPLACED #1 HYD PUMP OPS AND

LK CK GOOD

291151

Text to be analysed Principle:

If a keyword or a combination

of keywords is discriminative

in several coded examples,

then a lexical item or rule is

added for the corresponding

code.

Dictionary creation is

compliant with PROVALIS

exchange format to benefit

from the rich set of Text

Mining methods implemented

in WordStat tool, for case

studies.

ATA Code candidates

291100

@ENG1&HYD [##ENGINE_1 NEAR HYD* /Y /C5]

@GREEN&EMP [##GREEN NEAR ##EMP /Y /C5]

@GREEN&ENG1 [##GREEN NEAR ##ENGINE_1 /Y /C5]

@GREEN&HYD [HYD* NEAR ##GREEN /Y /C5]

With

##ENGINE_1 = [ENGINE BEFORE 1 /Y /C2]

##GREEN = G or GREEN

##EDP = EDP or [ENG* NEAR DRIV* NEAR PUMP* /A]

291100

291151

@EDP&LEFT [##EDP NEAR LEFT /Y /C7]

@GREEN&EDP [##GREEN AND ##EDP /Y /C]

@EDP&ENG1 [##EDP AND ##ENGINE_1 /Y /C]

@ENG1&PMP [##ENGINE_1 AND ##PUMP /Y /S]

1030GK

match

6D

4D

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 12

Fast Dictionary builder

Rules and keywords

(already defined) Existing coding selection

(here ATA)

REQUIREMENT #1:

ease observation

and rules creation

Relevant keywords related

to ATA 291151 based on

TF-IDF TF-ICF analysis

Terms co-occurrence

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 13

Lesson learnt & Perspectives

• First experiments to analyze logbooks through automated coding have shown that Machine Learning scores are

difficult to exploit (mainly due to existing coding errors, inconsistencies and code representativeness in the training datasets).

• Machine learning models also learn “bad” coding habits.

• Rules based on a lexical approach have been implemented to provide suggestions to “human” coders and improve quality.

Rules are managed to implement relevancy, redundancy, accuracy principles.

Based on coding activity traces, individually, suggestions are 85 to 95% accurate with good hopes for improvement.

• Next improvements would be:

• To link and visualize logbooks data in a context of past and next maintenance events, to deliver better results.

• To learn scoring models to adapt ranking mechanisms according human decision makings

• To model the codes through analogy with their prototypical examples (human categorization)

It is now possible to generate positive/negative pairs of a priori semantically similar texts to test Deep Learning disambiguation

(e.g. Siamese Recurrent Architectures).

- 16/05/2017 Des méthodes aux applications du TAL dans le retour d’expérience 14

© Airbus 2017. All rights reserved. Proprietary document. This document and all information contained herein is the sole property of AIRBUS. No intellectual property rights are granted by the

delivery of this document or the disclosure of its content. This document shall not be reproduced or disclosed to a third party without the express written consent of AIRBUS. This document and

its content shall not be used for any purpose other than that for which it is supplied. The statements made herein do not constitute an offer. They are based on the mentioned assumptions and

are expressed in good faith. Where the supporting grounds for these statements are not shown, AIRBUS will be pleased to explain the basis thereof.

AIRBUS, its logo, A300, A310, A318, A319, A320, A321, A330, A340, A350, A380, A400M are registered trademarks.

Appendix – Past Coding HMI

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference

Appendix – Coding detail

Name Description

AC Aircraft NA Not Aircraft NM Maintenance

NO Operation

NS Lack of spares

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference

Chargeability

SYMPTOM/FAULT codes

Effect codes

Name Description

AA Aborted approach

AB Abandoned aircraft AT Aborted take off BL Belly landing, ditching

CH Aircraft changed

CN Cancellation

DP Depressurisation

DV Diversion

DY Delay

ED Emergency descent ES Engine shut down in flight ET Etops incident GI Ground interruption

IF Interrupted flight JT Jettison

TL Tail strike

Name Description

AS APU auto shut down without crew action

BA Bomb alert BF Buffering

BS Bird strike

DF Handling difficulties, refuel inop

DI Disconnecting, disengaging

EI Erractic indication

EM Emergency system failure

EX Parameter limit exceedance

FE Fire extinguisher activated

FF Fatigue failure

FI Fire (real or warning) FO Foreign object damage

FU Fuel system failure

FW False warning

HA Hail HL Hard landing

JM Jamming

LK Leaks

LL Low level LP Low pressure

LS Lightning strike

MF Multiple failure

OD Odour in cabin

OV Overheat PL Partial power loss

RO Runway overshoot SA Engine slow to accelerate

SD Significant primary structure failure or damage

SE Significant speed exceedance

SF System failure, emergency procedure, abnormal oper SG Engine stall SK Smoke (real or warning) TB Turbulance

UF Uncontained failure

UN Unlocked

VB Vibrations

Appendix – Coding detail

Name Description

AD Adjusted

CL Cleaned

DI De-iced

FD FOD

IN Inspected, checked

LB Lubricated

ME MEL applied

PR Purged, bleed

RK Reracked

RP Replaced

RR Repaired

RS Reset SR Serviced

SW Swapped

TG Tightened, secured

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference

Name Description

AD Wrong Adjustment AW Abnormal wear BR Broken

BU Heat Damage

CH Chaffed

CK Crack

CL Clogged

CT Contaminated fluid

DB Debonding

DC Discard

DE Deformation

DI Disconnected

DL Delamination

DT Dirty

ER Erosion

FL Fluid Level Out of limit FN Furnishing Damaged

IC Ice Frozen

IN Fluid Ingress

JM Jammed

LB Lightbulb Burnt LE Leak

LP Pressure Out of limit LT Tension Out of limit MI Missing

NI Nil finding

OT Other PL Protection Damage

RT Corrosion

SC Scratches

SE Seal Damage

SP Separated

SV Incorrect servicing

TF Test Failure

TP TPS Damage

TW Tire Worn

UN Unlocked

UR Unreadable

Finding codes Action codes

Classification errors analyzis

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference

From rules to regex patterns + metadescriptions

Understanding of a rule is easier through metadescriptions however rules are suffering from exceptions that require to

redefine them at low level with regular expressions

EX1 (leaking|leaks?|leakage|fluid)(?!.+((ck|chk|check|

(checked|chk'?d?) (normal|ok|good)))) => LK

EX2 (?<!(not|no|eng)[-:\s\w]{1,10})(apu)[-:\s\w]{1,40}(auto(matic)?)? ?s/?(hut)?(-|s |

)?(down?)? => AS

Both metadescription and reg expressions are relevant to support management of the system

e.g. LEAK* NOT NEAR #NEGATION => LK

APU_SHUTDOWN expressions NOT JUST BEFORE NEGATION => AS

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference

LAV GALLEY zone

Use Tab 'Insert - Header & Footer' for Presentation Title - Siglum - Reference