algospot OCR

알고리즘/algospot

algospot OCR

꾸준함. 2018. 2. 1. 17:02

문제 링크입니다:https://algospot.com/judge/problem/read/OCR

여태까지 봤던 문제들 중에서 개인적으로 제일 어려운 문제라고 생각합니다.

Viterbi algorithm(비터비 알고리즘)을 이용하여 푸는 문제인데 문제해설을 세번이나 정독했는데도 아직 부족하다는 느낌이 듭니다.

다음 문제를 풀기 전에 Viterbi algorithm을 충분히 익혀야겠습니다...

OCR은 사람이 쓰거나 기계로 인쇄한 글자를 스캔한 이미지를 다시 기계가 읽을 수 있는 문자로 변환하는 과정이다.

과거에 인식했던 수많은 문장들을 분석해 원본 문장의 형태를 파악하려고 한다

//중략

어떤 문장을 단어별로 인식한 결과가 주어졌을 때,

원본일 조건부확률이 가장 높은 ㅁ누장을 찾아내는 프로그램을 작성하시오

#include <iostream>

#include <string>

#include <cmath> //log

using namespace std;

//문장을 구성하는 단어의 개수, 단어의 수, 처리할 문장의 수

int composition, wordNum, sentenceNum;

//분류기가 반환한 문장, 단어 번호로 변환되어 있음

int classified[100];

//Begin[i] i번째 단어가 첫 단어로 나올 확률

double Begin[501];

//rightAfter[i][j]=i 단어 이후에 j 단어가 나올 확률의 로그값

double rightAfter[501][501];

//missMatched[i][j]=i 단어가 j 단어로 분류될 확률의 로그값

double missMatched[501][501];

int choice[102][502];

double cache[102][502]; //1로 초기화

string word[501]; //입력받은 단어들의 목록

//Q[segment] 이후를 채워서 얻을 수 있는 최대 g() 곱의 로그값 반환

//Q[segment-1]==previousMatch라 가정

double recognize(int segment, int previousMatch)

{

if (segment == composition)

return 0;

double &result = cache[segment][previousMatch];

if (result != 1.0)

return result;

result = -1e200; //log(0)=음의 무한대

int &choose = choice[segment][previousMatch];

//classified[segment]에 대응되는 단어

for (int thisMatch = 1; thisMatch <= wordNum; thisMatch++)

{

//문장 R이 주어질 때 조건부확률 P(Q|R)을 최대화하는 원문 Q를 찾는 알고리즘

//P(Q|R)=(P(R|Q)*P(Q))/P(R) => 여기서 P(Q|R)이란 P(Q∩R)/P(R)

//P(R|Q)는 원문이 Q일 때 분류기가 R을 반환할 확률: 따라서 P(R|Q)=∏[M(Q[i], R[i])] => 여기서 M(a, b)=단어 a를 b로 분류할 확률

//P(Q)=Begin(Q[0])*∏[rightAfter(Q[i-1], Q[i])]

//f(Q)=P(R|Q)*P(Q)=∏[missMatched(Q[i], R[i])*rightAfter(Q[i-1], Q[i])]=∏g(Q[i])

//g(thisMatch)=rightAfter(previousMatch, thisMatch)*missMatched(thisMatch, R[segment])

double candidate = rightAfter[previousMatch][thisMatch] + missMatched[thisMatch][classified[segment]] + recognize(segment + 1, thisMatch);

if (result < candidate)

{

result = candidate;

choose = thisMatch;

}

return result;

}

//광학 문자 인식 문제의 실제 답 계산하기

string reconstruct(int segment, int previousMatch)

{

int choose = choice[segment][previousMatch];

string result = word[choose];

if (segment < composition - 1)

result = result + " " + reconstruct(segment + 1, choose);

return result;

}

//캐시 초기화

void initialize()

{

for (int i = 0; i < composition; i++)

for (int j = 0; j <= wordNum; j++)

cache[i][j] = 1.0;

}

int main(void)

{

cin >> wordNum >> sentenceNum;

if (wordNum < 1 || wordNum>500 || sentenceNum < 1 || sentenceNum>100)

exit(-1);

for (int i = 1; i <= wordNum; i++)

cin >> word[i];

for (int i = 1; i <= wordNum; i++)

{

cin >> Begin[i];

Begin[i] = log(Begin[i]); //log로 변환

}

for (int i = 0; i <= wordNum; i++)

for (int j = 1; j <= wordNum; j++)

{

//책의 트릭을 이용하여 시작단어를 [0][j] 인덱스에 저장

//즉, Q[0]가 항상 시작단어라고 지정

//그렇게 하면 P(Q)=∏(rightAfter(Q[i-1], Q[i])) => Begin(Q[0])*∏(rightAfter(Q[i-1], Q[i]))보다 간단해졌다

if (i == 0)

rightAfter[i][j] = Begin[j];

else

{

cin >> rightAfter[i][j];

rightAfter[i][j] = log(rightAfter[i][j]);

}

for(int i=1; i<= wordNum; i++)

for (int j = 1; j <= wordNum; j++)

{

cin >> missMatched[i][j];

missMatched[i][j] = log(missMatched[i][j]);

}

for (int i = 0; i < sentenceNum; i++)

{

cin >> composition;

if (composition < 1 || composition>100)

exit(-1);

initialize();

for (int i = 0; i < composition; i++)

{

string input;

cin >> input;

for (int j = 1; j <= wordNum; j++)

if (input == word[j])

{

classified[i] = j;

break;

}

recognize(0, 0);

cout << reconstruct(0, 0) << endl;

}

return 0;

}

개발환경:Visual Studio 2017

지적, 조언, 질문 환영입니다! 댓글 남겨주세요~

[참고] 프로그래밍 대회에서 배우는 알고리즘 문제해결전략

저작자표시 비영리 (새창열림)

'알고리즘 > algospot' 카테고리의 다른 글

algospot KLIS (0)	2018.02.02
algospot MORSE (2)	2018.02.01
algospot PACKING (0)	2018.02.01
최대 부분 수열 직접 구하기(LIS) (3)	2018.01.31
algospot NUMB3RS (0)	2018.01.29

현재글algospot OCR

꾸준함

algospot OCR

'알고리즘 > algospot' 카테고리의 다른 글

'알고리즘/algospot'의 다른글

티스토리툴바

algospot OCR

'알고리즘 > algospot' 카테고리의 다른 글

'알고리즘/algospot'의 다른글

관련글

티스토리툴바