Published in:

Infoteka, 2002, vol. 3, no. 1-2, pp. 13-21     

Scriptor: Bibliographic Information Parsing Program

Dejan Pajić

Department of Psychology,
Faculty of Philosophy in Novi Sad

Pero Šipka

Department of Psychology,
Faculty of Philosophy in Novi Sad

Biljana Kosanović

National Library of Serbia, Belgrade

Abstract: Scriptor - a program developed for the use in maintaining SocioFakt and designed for parsing journals' contents and references is described. By making use of auxiliary databases (e.g. lists containing authors and publishers' names) and simple algorithms for processing Serbian as natural language, the program recognizes the elements of the journals' contents and articles' references (e.g. author name, book title, journal title, page numbers) and assigns a standardized label to each of those elements, providing automatic transfer of information into the respective database field.

Apart from basic parsing module, the program provides subroutines for conversions of various character sets, word (de)capitalization according to orthographic rules, inversion of author's name and surname position, filling up the missing data, as well as interactive control and correction of the parsed information.

Scriptor comes with an installation program and detailed help file which contains specific instructions for the operators explaining ways to effectively use program itself, and defining bibliographic standards used in the process SocioFakt maintenance. Scriptor is written in Visual Basic for Applications as an Microsoft Word template.

Key words: bibliographic information, parsing, bibliographic databases, citation information, software

Full text (in serbian)

Address

trnska2

Trnska 3, Belgrade, Serbia

Contact

+381 11 406 11 65
+381 11 406 11 86
This email address is being protected from spambots. You need JavaScript enabled to view it.

Working hours

Monday - Friday:
From 8.30 to 17.00
VAT No.: 100136238
Institution ID: 17355830

Partners

DOAJ logo
COPE logo
CROSSREF logo
OPENAIRE logo