Indexing software: Difference between revisions

Content deleted Content added
Features: Added standalone and AI approaches to the list of methodologies
Line 5:
== Features ==
There are several methodologies for indexing:<ref name="software"/><ref>{{cite journal | last=Golub | first=Koraljka | title=Automatic subject indexing of text (IEKO) | url=https://www.isko.org/cyclo/automatic | access-date=2022-06-03|date=2019|journal=Knowledge Organization|volume=46|issue=2|pages=104-121}} </ref>
* Standalone indexing applications enable an indexer to create an index as a separate document, later to be integrated into the original text, by manually entering headings and page numbers or other locators. Such applications collate, alphabetize, and sort the raw input to create a formatted index.
* [[Index (publishing)#Embedded indexing|Embedded indexing]] includes the index headings in the midst of the text itself, but surrounded by codes so that they are not normally displayed. A usable index is then generated automatically from the embedded text using the position of the embedded headings to determine the locators. Thus, when the pagination is changed the index can be regenerated with the new locators.
* Tagging allowallows indexing codes to be embedded in the text after the indexing is complete. The indexer inserts numbered dummy tags in the files, and then builds the index separately
* Many [[word processor]]s and [[desktop publishing]] software have integrated automated indexing functions. These tools build a concordance or word lists from processed files. They have often limited usage.
* AI and machine-learning approaches have not yet matured to the point where they can create finished or near-finished indexes.
 
==See also==