Content deleted Content added
Added tags to the page using Page Curation (uncategorised) |
Citation bot (talk | contribs) Add: authors 1-1. Removed URL that duplicated identifier. Removed parameters. Some additions/deletions were parameter name changes. | Use this bot. Report bugs. | Suggested by Abductive | Category:Social networks | #UCB_Category 73/125 |
||
(23 intermediate revisions by 14 users not shown) | |||
Line 1:
{{Short description|Mathematical tool}}
'''Main path analysis''' is a mathematical tool, first proposed by [[Norman P. Hummon|Hummon]] and [[Patrick Doreian|Doreian]] in 1989,<ref name=":0">{{Cite journal|last1=Hummon|first1=Norman P.|last2=Doreian|first2=Patrick|year=1989|title=Connectivity in a citation network: The development of DNA theory|journal=Social Networks|volume=11|issue=1|pages=39–63|doi=10.1016/0378-8733(89)90017-8}}</ref> to identify the major paths in a [[citation network]], which is one form of a [[directed acyclic graph]] (DAG). It has since become an effective technique for mapping technological trajectories, exploring scientific knowledge flows, and conducting literature reviews. [[File:Global key-route main paths for a citation network.svg|thumb|Main path analysis uncovers the most significant paths, or citation chains, in a citation network. The figure shows the global key-route main paths (in red) for a sample citation network (based on search path count and at key-route 1).|220x220px]]
== History ==
Main path analysis is first proposed in Hummon and Doreian (1989)<ref name=":0" /> in which they suggest a different approach for analyzing a citation network "where the connective threads through a network are preserved and the focus is on the links in the network rather than on the nodes."<ref name=":0" /> They call the resulting chain of the most used citation links "main path" and claim that "It is our intuition that the main path, selected on the basis of the most used path will identify the main stream of a literature." The idea was verified using a set of DNA research articles. To make the method more practical, Liu and Lu (2012)<ref name=":1">{{Cite journal|
== The method ==
Main path analysis
=== Preparing a citation network ===
It is necessary to prepare a [[citation network]] before starting main path analysis. In a citation network, the nodes represent the documents such as academic articles, patents, or legal cases. These nodes are connected using citation information. Citation networks are by nature directed because the two nodes on the opposite end of a link are not symmetrical in their roles. As regards to the direction, this article adopts the convention that the cited node points to the citing node, signifying the fact that knowledge in the cited node flows to the citing node. Citation network is also by nature acyclic, which means that a node can never chain back to itself if one moves along the links following their direction.
Several terms related to a citation network are defined here before proceeding further. Heads are the nodes the direction arrow leads to. Tails are the nodes on other ends of the direction arrow. Sources are the nodes that are cited but cite no others.
[[File:SPC values for a citation network.png|thumb|Figure 1. SPC values for a sample citation network]]
=== Traversal counts ===
Traversal counts measure the significance of a link. The literature discusses several types of traversal counts, including search path count (SPC), search path link count (SPLC), search path node pair (SPNP), and other variations.<ref name=":5" />
[[File:SPLC values for a citation network.png|thumb|Figure 2. SPLC values for a sample citation network]]
==== Search path count (SPC) ====
A link’s SPC is the number of times the link is traversed if one runs through all possible paths from all the sources to all the sinks. SPC is first proposed by [[Vladimir Batagelj]].<ref>Batagelj, V. (2003). Efficient algorithms for citation network analysis.
[[File:SPNP values for a citation network.png|thumb|Figure 3. SPNP values for a sample citation network]]
==== Search path link count (SPLC) ====
A link’s SPLC is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the sinks.
==== Search path node pair (SPNP) ====
A link’s SPNP is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the descendants of the head node (including itself). SPNP is first proposed by Hummon and Doreian.<ref name=":0" />
[[File:Local main paths SPC.png|thumb|Figure 4. Local main paths in a sample citation network]]
Line 43 ⟶ 44:
==== Key-route search ====
Key-route search is designed to avoid the problem of missing significant links in both the local and global search. The problem is in the local and global main paths shown above, in which one of the most important links (H, K) is not included in the main paths. As described in Liu and Lu (2012),<ref name=":1" />
== The Variants ==
In addition to the key-route search approach, variations of the method include the approach that is aggregative and stochastic,<ref>{{Cite journal|
== Applications ==
Line 52 ⟶ 53:
=== Academic article ===
Academic citation databases such as [[Web of Science]] and [[Scopus]] include comprehensive digitized citation information. These information make it possible to apply main path analysis to examine the knowledge structure or trace the knowledge flow of any scientific fields. Some early applications explores the subject of centrality-productivity,<ref>{{Cite journal|
=== Patent ===
Patents referencing prior arts is a common practice. For example, each United States patent document includes a "References Cited" section that lists the prior arts of the patent. Patent databases such as [[Clarivate Analytics]] and Webpat provide digitized patent citation information. Verspagen (2007)<ref
=== Judicial document ===
Line 71 ⟶ 72:
After traversal counts are computed, the following command sequences find the main paths.
For local main paths
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Local Search → Forward</small>''
Line 86 ⟶ 87:
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Global Search → Key-Route</small>''
In addition to key-route search, a more flexible search feature is added starting from Pajek version 5.03 (January 4, 2018). The new feature allows for local and global search passing through vertices defined by a cluster. The command sequences are as follows:
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Local Search → Key-Route → Through Vertices in Cluster</small>''
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Global Search → Key-Route → Through Vertices in Cluster</small>''
== References ==
Line 91 ⟶ 98:
== External links ==
* [http://mrvar.fdv.uni-lj.si/pajek/ Pajek], a free social network analysis software.
* [https://davincierlab.weebly.com/list-of-main-path-articles.html List of main path articles], this page contain a list of academic articles that introduce, explain, apply, modify, or extend the method originated in Hummon and Doreian.
__FORCETOC__
[[Category:Social networks]]
|