什么是 H – 指数？

什么是 H 指数？ “H”代表赫希指数，它是由 JE Hirsch 在 2005 年提出的。h 指数被定义为作者级别的指标，它试图衡量科学家或学者的出版物的生产力和引用影响.

有两个参数需要考虑——

数量- 论文数量
质量——引用次数

基本上，H-index 是最大的数字，因此许多出版物至少有相同数量的引用。作为一个有用的指标来表征研究人员的科学成果。

计算 H 指数 –

例如，假设一位研究人员总共发表了 10 篇论文。

Research Paper	No. of Citations
1	50
2	40
3	33
4	23
5	12
6	11
7	8
8	5
9	1
10	0

H-index is always <= total numbers of papers published

编程需要懂一点英语

为方便起见，我们将引用次数按降序排列。

H-index 不能为 10，因为至少有 10 篇被引用次数在 10 次或超过 10 次的研究论文。相似地，

H指数不能为9，
H指数不能为8，
H-index 为 7，因为有 7 篇研究论文被 7 次或超过 7 次引用。

例子：

Input : Citations = [7, 6, 5, 4, 3]
Output : 4
Explanation : There are 5 papers in total. 
              Since the researcher has 4 papers with at least 4 citations each 
              and the remaining one paper has less than 4 citations. 
              So H-index is 4.

寻找 H 指数的方法：

按升序或降序对引文数组进行排序。
从最低的论文迭代到最高的论文。
剩余论文（结果）是满足 H-index 条件的论文数。

# calculating H-Index
def H_index(citations):
      
    # sorting in ascending order
    citations.sort()
      
    # iterating over the list
    for i, cited in enumerate(citations):
          
        # finding current result
        result = len(citations) - i
          
        # if result is less than or equal
        # to cited then return result
        if result <= cited:
            return result
           
    return 0
  
# creating the citations
citation = [50, 40, 33, 23, 12, 11, 8, 5, 1, 0]
  
# calling the function
print(H_index(citation))

输出

时间复杂度： O(nlogn + n)
空间复杂度： O(1)

H - 指数的局限性：

不同领域的研究人员可以有不同的引用行为。
我们无法比较两位研究人员具有不同的领域和巨大的研究经验差距。与经验较少的研究人员相比，经验丰富的研究人员将具有较高的 H 指数。
H-index 值取决于您使用的数据库，它可能因平台而异。