Automatická detekce témat

Title Alternative:Automatic detection of topics
dc.contributor.advisorDrábková, Jindra
dc.contributor.authorČerný, Jiří
dc.date2008
dc.date.accessioned2013-12-20
dc.date.available2013-12-20
dc.date.committed2008-05-16
dc.date.defense2008-06-11
dc.date.issued2013-12-20
dc.date.submitted2007-10-31
dc.degree.levelmgrcs
dc.descriptionkatedra: ITE; přílohy: 1 DVD; rozsah: 102cs
dc.description.abstractvyhledání a zhodnocení informací o automatické klasi kaci dokumentů, seznámení s jazykem Perl a balíkem LWP pro potřeby práce s textovými dokumenty, nalezení klasi kátorů v programu WEKA, porovnání různých metod klasi kace a parametrizace textů.cs
dc.description.abstractThe aim of diploma thesis is to find sufficient sequence which can sort out unsigned text documents. It means to prepare a lot of training data for classifier learning. The fruitfulness of classifer is tested by the help of testing data. Newspaper articles from server zpravy.atlas.cz are used as a testing data. The first part of diploma thesis is about automatic detection theory. The second part of diploma thesis is about finding the classifier by the help of program WEKA. Data is processed by the help of programming language Perl and package LWP. Simple text isn't suitable for next processing. For this reason a global dictionary is created. Documents are converted into feature vectors. These vectors can be written by the help of different representation. In diploma thesis different sorts of representation are tested. Program WEKA is used for training classifiers, cluster analysis and select attributes. In this program different representation feature vectors and classifiers algorithms are tested.en
dc.formattext
dc.identifier.urihttps://dspace.tul.cz/handle/15240/644
dc.language.isocs
dc.publisherTechnická Univerzita v Libercics
dc.subjectperlcs
dc.subjectwekacs
dc.subjectautomatická klasifikacecs
dc.subjectklasifikátorcs
dc.subjectpříznakový vektorcs
dc.subjecttřídění dokumentůcs
dc.subjectperlen
dc.subjectwekaen
dc.subjectautomatic classificationen
dc.subjectclassifieren
dc.subjectfeature vectoren
dc.subjectsort out documentsen
dc.subject.verbisautomatické řízenícs
dc.titleAutomatická detekce tématcs
dc.title.alternativeAutomatic detection of topicsen
dc.typeThesis
local.departmentITEcs
local.facultyFakulta mechatroniky, informatiky a mezioborových studiícs
local.identifier.stag14975
local.identifier.verbis353306
local.note.administratorsoprava_A
local.verbis.aktualizace2019-10-05 06:17:46cs
local.verbis.studijniprogramITEcs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
mgr_14975.pdf
Size:
751.68 KB
Format:
Adobe Portable Document Format
Description:
kvalifikační práce