İngilizcede çember dolgu oranı yöntemi ile optik karakter tanıma

Tilki, Sahra

İngilizcede çember dolgu oranı yöntemi ile optik karakter tanıma

dc.contributor.advisor	Şirin, Yahya
dc.contributor.author	Tilki, Sahra
dc.contributor.author	Tilki, Sahra
dc.date.accessioned	2022-03-10T13:40:36Z
dc.date.available	2022-03-10T13:40:36Z
dc.date.issued	2020
dc.department	İstanbul Sabahattin Zaim Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Bilimleri ve Mühendisliği Ana Bilim Dalı	en_US
dc.description.abstract	Çeşitli alanlarda, mevcut olan verilerden bilgisayar ortamına bilgi depolaması yapabilmek, el yazısıyla yazılmış veya basılı dokümanlarda bulunan bilgileri, bilgisayarlar aracılığıyla tekrar kullanabilmek için fazla talep bulunmaktadır. Ancak bu bilgilerin olduğu belge veya dosyaları yeniden kullanabilmek, saklayabilmek için, bilgilerin okunması ve sorgulanması gerekmektedir. Bu nedenler doğrultusunda görüntü dosyalarından, özellikle de metinlerdeki bilgileri otomatik olarak alma ve depolama tekniklerine ihtiyaç vardır. Optik Karakter Tanıma (OCR), görüntülerden otomatik olarak metin çıkarma ve işleme yeteneğine sahip bir bilgisayar sistemi geliştirmeye çalışan, son yıllarda araştırmacıların ilgisini çeken aktif bir araştırma alanıdır. OCR' nın amacı, herhangi bir metin biçiminde veya el yazısı içeren belgelerde değişiklik veya dönüşüm elde etmektir. Bu sayede daha derin ve daha ileri işlemler için yazdırılabilir veya taranmış metin görüntüleri, dijital formatta düzenlenebilir ve saklanabilir. Görüntüdeki karakterlerin doğru tanınması için görüntü üzerinde var olan çeşitli bozukluklar, karakterlerin font özellikleri, ışık miktarı, gibi birçok etkenin düzenlenmesi gerekmektedir. Bu nedenle kullanılacak görüntüler üzerinde gürültü azaltma, rotasyon düzeltme, ikili hale getirme ve normalizasyon gibi çeşitli görüntü işleme teknikleri uygulamak gerekir. Yapılan bu işlemler, sonraki aşamalarda karşılaşılabilecek zorluklara çözüm olup, doğru tanıma yapabilmek açısından oldukça önemli bir yere sahiptir. Bu çalışmada, iskelet temsilini baz alan bir yöntem olan Çember Dolgu Oranı yönteminden elde edilen bilgiler ile derin öğrenme tekniklerinden biri olan CNN(Konvolüsyonel Sinir Ağı) modelinin birlikte kullanılmasının sonucunda daha az veri ile tanıma hızının artırılması ve yüksek doğruluk oranında tanıma amaçlanmaktadır. Çalışmada kullanılan görüntüler, Matlab platformunda gerekli görüntü işleme teknikleri uygulanarak oluşturulan CNN modelinde eğitim için kullanılmak üzere hazır hale getirilmiştir. İlk aşamada CNN modelinde girdi olarak karakter görüntüleri kullanılmıştır. Daha sonra iskelet temsilleri oluşturulan görüntülerin, iskelet noktalarıyla ve Çember Dolgu Oranı yöntemi ile elde edilen noktalara PCA (Temel Bileşenler Analizi) işlemi uygulanarak görsel biçimde kullanılması sağlanmış ve bu işlemle birlikte yeni görüntüler elde edilmiştir. Bu görüntüler aynı CNN modelinde kullanılıp eğitim yapılmıştır. Gerçekleştirilen eğitimler sonucunda ulaşılan test sonuçlarının doğruluk oranları arasında performans kıyaslamaları yapılmış ve deneysel çalışmalarla sonuçları sunulmuştur.	en_US
dc.description.abstract	There is a lot of demand to be able to store information from the data available in various fields to the computer environment, to reuse information which is handwritten or printed documents via computers. However, in order to reuse and store the documents or files containing this information, the information must be read and questioned. For these reasons, there is a need for automatic retrieval and storage of information from image files, especially text. Optical Character Recognition (OCR) is an active research area that has been attracting researchers in recent years, trying to develop a computer system capable of automatically extracting and processing text from images. The purpose of OCR is to achieve changes or conversions in any text format or documents containing handwriting. Thus, printed or scanned text images can be edited and stored in digital format for deeper and further processing. In order for the characters in the image to be recognized correctly, various factors such as various defects on the image, the font properties of the characters, and the amount of light should be regulated. For this reason, it is necessary to apply various image processing techniques such as noise reduction, rotation correction, binarization and normalization on the images to be used. These procedures are a solution to the difficulties that may be encountered in the next stages and have a very important place in terms of making accurate recognition. In this study, it is aimed to increase the recognition speed and recognition with less data as a result of using the information obtained from the Circle Fill Rate method, which is a method based on skeletal representation, and CNN (Convolutional Neural Network) which is one of the deep learning techniques, together with less data.The images used in the study are made ready to be used for training in the CNN model created by applying the necessary image processing techniques in Matlab platform. In the first stage, character images were used as inputs in the CNN model. Later, the images created with skeletal representations were used visually by applying PCA (Principle Components Analysis) to the points obtained with the skeleton points and the Circle Filling Rate method, and new images were obtained with this process. These images were used in the same CNN model and trained. Performance comparisons were made between the accuracy rates of the test results obtained as a result of the trainings conducted and the results were presented with experimental studies.	en_US
dc.identifier.citation	Tilki, S. (2020). İngilizcede çember dolgu oranı yöntemi ile optik karakter tanıma. (Yüksek Lisans Tezi). İstanbul Sabahattin Zaim Üniversitesi Lisansüstü Eğitim Enstitüsü.
dc.identifier.endpage	99	en_US
dc.identifier.startpage	1	en_US
dc.identifier.uri	https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=v7BkNnnepTnbhn8rNR77LYxCeXCjuZP_aty5agJcrbzMdF3NPdF_-zlNEtgIBlfw
dc.identifier.uri	https://hdl.handle.net/20.500.12436/3476
dc.identifier.yoktezid	689747	en_US
dc.institutionauthor	Tilki, Sahra
dc.language.iso	tr
dc.publisher	İstanbul Sabahattin Zaim Üniversitesi	en_US
dc.relation.publicationcategory	Tez	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	en_US
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.subject	OCR, Görüntü İşleme, Nesne Tanıma, Derin Öğrenme, CNN.	en_US
dc.subject	OCR, Image Processing, Object Recognition, Deep Learning, CNN.	en_US
dc.title	İngilizcede çember dolgu oranı yöntemi ile optik karakter tanıma	en_US
dc.title.alternative	Optical character recognition with circle filling rate method for English	en_US
dc.type	Master Thesis
dspace.entity.type	Publication
relation.isAuthorOfPublication	4b84ec9f-70a0-43dd-b9db-561485fcbff1
relation.isAuthorOfPublication.latestForDiscovery	4b84ec9f-70a0-43dd-b9db-561485fcbff1

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: 689747.pdf
Boyut:: 2.03 MB
Biçim:: Adobe Portable Document Format
Açıklama:: Tez dosyası / Thesis file

İndir

Koleksiyon

Bilgisayar Bilimleri ve Mühendisliği