Given an array of integers citations where citations[i] is the number of citations a researcher received for their ith paper, return the researcher's h-index.

According to the definition of h-index on Wikipedia: The h-index is defined as the maximum value of h such that the given researcher has published at least h papers that have each been cited at least h times.

Example 1:

Input: citations = [3,0,6,1,5]
Output: 3
Explanation: [3,0,6,1,5] means the researcher has 5 papers in total and each of them had received 3, 0, 6, 1, 5 citations respectively. Since the researcher has 3 papers with at least 3 citations each and the remaining two with no more than 3 citations each, their h-index is 3.

Example 2:

Input: citations = [1,3,1]
Output: 1

Constraints:

n == citations.length
1 <= n <= 5000
0 <= citations[i] <= 1000

Approach #1 (Sorting)

Think geometrically. Imagine plotting a histogram where the y-axis represents the number of citations for each paper. After sorting in descending order, the h-index is the length of the largest square in the histogram.

Algorithm:

To find such a square length, we first sort the citations array in descending order. After sorting, if citations[i] > i, then papers 0 to i all have at least i+1 citations. Thus, to find the h-index, we search for the largest i (let's call it i') such that citations[i] > i and therefore the h-index is i'+1.


public class Solution {
      public int hIndex(int[] citations) {
         // sorting the citations in ascending order
         Arrays.sort(citations);
         // finding h-index by linear search
         int i = 0;
         while (i < citations.length && citations[citations.length - 1 - i] > i) {
            i++;
         }
         return i; // after the while loop, i = i' + 1
      }
}

Complexity Analysis

Time complexity : O(nlogn). Comparison sorting dominates the time complexity. Space complexity : O(1). Most libraries using heap sort which costs O(1) extra space in the worst case.

Continue Reading →