Content deleted Content added
Johnliu.tw (talk | contribs) m Added more contents. |
Citation bot (talk | contribs) Add: authors 1-1. Removed URL that duplicated identifier. Removed parameters. Some additions/deletions were parameter name changes. | Use this bot. Report bugs. | Suggested by Abductive | Category:Social networks | #UCB_Category 73/125 |
||
(43 intermediate revisions by 16 users not shown) | |||
Line 1:
{{Short description|Mathematical tool}}
Main path analysis was first proposed by Hummon and Doreian<ref name=":0">{{Cite journal|last=Hummon|first=Norman P.|last2=Doreian|first2=Patrick|date=|title=Connectivity in a citation network: The development of DNA theory|url=https://doi.org/10.1016/0378-8733(89)90017-8|journal=Social Networks|volume=11|issue=1|pages=39–63|doi=10.1016/0378-8733(89)90017-8|via=}}</ref>. It is a mathematical tool to identify the major paths in a [[directed acyclic graph]] (DAG), typically [[citation network]]. The method begins by measuring the significance of all the links in a citation network through the concept of ‘traversal count’ and then sequentially chains the most significant links into a ‘main path’, which is deemed the most significant historical path in the target DAG. The method is applicable to any human activity that can be organized in the form of a DAG. The most common applications are tracing the knowledge flow paths or development trajectories of a science or technology field through bibliographic citations or patent citations<ref name=":2">{{Cite journal|last=Liu|first=John S.|last2=Lu|first2=Louis Y.Y.|last3=Lu|first3=Wen-Min|last4=Lin|first4=Bruce J.Y.|title=Data envelopment analysis 1978–2010: A citation-based literature survey|url=https://doi.org/10.1016/j.omega.2010.12.006|journal=Omega|volume=41|issue=1|pages=3–15|doi=10.1016/j.omega.2010.12.006}}</ref><ref>{{Cite journal|last=Verspagen|first=Bart|date=2007-03-01|title=Mapping technological trajectories as patent citation networks: a study on the history of fuel cell research|url=http://www.worldscientific.com/doi/abs/10.1142/S0219525907000945|journal=Advances in Complex Systems|volume=10|issue=01|pages=93–115|doi=10.1142/S0219525907000945|issn=0219-5259}}</ref><ref name=":3">{{Cite journal|last=Lucio-Arias|first=Diana|last2=Leydesdorff|first2=Loet|date=2008-10-01|title=Main-path analysis and path-dependent transitions in HistCite™-based historiograms|url=http://onlinelibrary.wiley.com/doi/10.1002/asi.20903/abstract|journal=Journal of the American Society for Information Science and Technology|language=en|volume=59|issue=12|pages=1948–1962|doi=10.1002/asi.20903|issn=1532-2890}}</ref>. It has also been applied to judicial decision to trace the evolving changes of legal opinion<ref>{{Cite journal|last=Liu|first=John S.|last2=Chen|first2=Hsiao-Hui|last3=Ho|first3=Mei Hsiu-Ching|last4=Li|first4=Yu-Chen|date=2014-12-01|title=Citations with different levels of relevancy: Tracing the main paths of legal opinions|url=http://onlinelibrary.wiley.com/doi/10.1002/asi.23135/abstract|journal=Journal of the Association for Information Science and Technology|language=en|volume=65|issue=12|pages=2479–2488|doi=10.1002/asi.23135|issn=2330-1643}}</ref>.▼
'''Main path analysis''' is a mathematical tool, first proposed by [[Norman P. Hummon|Hummon]] and [[Patrick Doreian|Doreian]] in 1989,<ref name=":0">{{Cite journal|last1=Hummon|first1=Norman P.|last2=Doreian|first2=Patrick|year=1989|title=Connectivity in a citation network: The development of DNA theory|journal=Social Networks|volume=11|issue=1|pages=39–63|doi=10.1016/0378-8733(89)90017-8}}</ref> to identify the major paths in a [[citation network]], which is one form of a [[directed acyclic graph]] (DAG). It has since become an effective technique for mapping technological trajectories, exploring scientific knowledge flows, and conducting literature reviews. [[File:Global key-route main paths for a citation network.svg|thumb|Main path analysis uncovers the most significant paths, or citation chains, in a citation network. The figure shows the global key-route main paths (in red) for a sample citation network (based on search path count and at key-route 1).|220x220px]]
▲
== History ==
Main path analysis is first
== The method ==
Main path analysis
=== Preparing a citation network ===
It is necessary to prepare a [[citation network]] before starting main path analysis. In a citation network, the nodes
[[File:SPC values for a citation network.png|thumb|Figure 1. SPC values for a sample citation network]]
=== Traversal counts ===
Traversal counts
[[File:SPLC values for a citation network.png|thumb|Figure 2. SPLC values for a sample citation network]]
==== Search path count (SPC) ====
A link’s SPC is the number of times the link is traversed if one runs through all possible paths from all the sources to all the sinks. SPC is first proposed by [[Vladimir Batagelj]].<ref>Batagelj, V. (2003). Efficient algorithms for citation network analysis.
[[File:SPNP values for a citation network.png|thumb|Figure 3. SPNP values for a sample citation network]]
▲A link’s SPC is the number of times the link is traversed if one runs through all possible paths from all the sources to all the sinks. SPC is first proposed by Vladimir Batagelj<ref>Batagelj, V. (2003). Efficient algorithms for citation network analysis. ''arXiv preprint cs/0309023''.</ref>.
A link’s SPLC is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the sinks.
==== Search path
A link’s SPNP is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the descendants of the head node (including itself). SPNP is first proposed by Hummon and Doreian.<ref name=":0" /> The SPNP values of the link (C, H) is 6 because there are 6 paths that begin from A, B, C (A and B are C's ancestors) and end at H and K (K is H's descendant). These paths are A-C-H, A-C-H-K, B-C-H, B-C-H-K, C-H, and C-H-K.▼
[[File:Local main paths SPC.png|thumb|Figure 4. Local main paths in a sample citation network]]
▲A link’s SPLC is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the sinks. SPLC is first proposed by Hummon and Doreian<ref name=":0" />.
▲Search path node pair (SPNP)
▲A link’s SPNP is the number of times the link is traversed if one runs through all possible paths from all the ancestors of the tail node (including itself) to all the descendants of the head node (including itself). SPNP is first proposed by Hummon and Doreian<ref name=":0" />.
=== Path search ===
Based on the traversal counts, one can then search for the most significant path(s). There are several ways of finding them, including local
[[File:Global main paths SPC.png|thumb|Figure 5. Global main paths in a sample citation network]]
==== Local search ====
[[File:Local key-route main paths SPC.png|thumb|Figure 6. Local key-route main paths in a sample citation network]]
==== Global search ====
Global search simply
[[File:Global key-route main paths SPC.png|thumb|Figure 7. Global key-route main paths in a sample citation network]]
==== Key-route search ====
Key-route search is designed to avoid the problem of missing significant links in both the local and global search. The problem is in the local and global main paths shown above, in which one of the most important links (H, K) is not included in the main paths. As described in Liu and Lu (2012),<ref name=":1" />
== The Variants ==
In addition to the key-route search approach, variations of the method include
== Applications ==
The method has been applied to three types of documenting system that
=== Academic article ===
Academic citation databases such as [[Web of Science]] and [[Scopus]]
=== Patent ===
Patents referencing prior arts is
===
In the common law system, a court decision document usually references previously published opinions for the purpose of justifying the current decision. These judicial references, or legal citations, can also be used to construct citation networks and then tracing the changes of legal opinions. Research opportunity in this area is wide open. Liu et al. (2014)<ref name=":4" /> conducted an exploratory study on such type of application.
== Software Implementation ==
Main path analysis is implemented in [http://mrvar.fdv.uni-lj.si/pajek/ Pajek], a widely used
''<small>Network → Acyclic Network → Create Weighted Network + Vector → Traversal Weights → Search Path Link Count (SPC), or</small>''
Line 68 ⟶ 72:
After traversal counts are computed, the following command sequences find the main paths.
For local main paths
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Local Search → Forward</small>''
Line 83 ⟶ 87:
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Global Search → Key-Route</small>''
In addition to key-route search, a more flexible search feature is added starting from Pajek version 5.03 (January 4, 2018). The new feature allows for local and global search passing through vertices defined by a cluster. The command sequences are as follows:
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Local Search → Key-Route → Through Vertices in Cluster</small>''
''<small>Network → Acyclic Network → Create (Sub)Network → Main Paths → Global Search → Key-Route → Through Vertices in Cluster</small>''
== References ==
Line 88 ⟶ 98:
== External links ==
* [http://mrvar.fdv.uni-lj.si/pajek/ Pajek], a free social network analysis software.
* [https://davincierlab.weebly.com/list-of-main-path-articles.html List of main path articles], this page contain a list of academic articles that introduce, explain, apply, modify, or extend the method originated in Hummon and Doreian.
__FORCETOC__
[[Category:Social networks]]
|