Coding Interview Questions: No. 13 - First Character Appearing Only Once

Monday, October 24, 2011

No. 13 - First Character Appearing Only Once

Problem: Implement a function to find the first character in a string which only appears once.

For example: It returns ‘b’ when the input is “abaccdeff”.

Analysis: Our native solution for this problem may be scanning the input string from its beginning to end. We compare the current scanned character with every one behind it. If there is no duplication after it, it is a character appearing once. Since it compares each character with O(n) ones behind it, the overall time complexity is O(n²) if there are n characters in a string.

In order to get the numbers of occurrence times of each character in a string, a data container is needed. It is required to get and update the occurrence time of each character in a string, so the data container is used to project a character to a number. Hash tables fulfill this kind of requirement. We can implement a hash table, in which keys are characters and values are their occurrence times in a string.

It is necessary to scan strings twice: When a character is visited, we increase the corresponding occurrence time in the hash table during the first scanning. In second round of scanning, whenever a character is visited we also check its occurrence time in the hash table. The first character with occurrence time 1 is the required output.

Hash tables are complex, and they are not implemented in the C++ standard template library. Therefore, we have to implement one by ourselves.

Characters have 8 bits, so there only 256 variances. We can create an array with 255 numbers, in which indexes are ASCII values of all characters, and numbers are their occurrence times in a string. That is to say, we have a hash table whose size if 256, with ASCII values of characters as keys.

It is time for programming after we get a clear solution. The following are some sample code:

char FirstNotRepeatingChar(char* pString)

{

if(pString == NULL)

return '\0';

const int tableSize = 256;

unsigned int hashTable[tableSize];

for(unsigned int i = 0; i<tableSize; ++ i)

hashTable[i] = 0;

char* pHashKey = pString;

while(*(pHashKey) != '\0')

hashTable[*(pHashKey++)] ++;

pHashKey = pString;

while(*pHashKey != '\0')

{

if(hashTable[*pHashKey] == 1)

return *pHashKey;

pHashKey++;

}

return '\0';

}

In the code above, it costs O(1) time to increase the occurrence time for each character. The time complexity for the first scanning is O(n) if the length of string is n. It takes O(1) time to get the occurrence time for each character, so it costs O(n) time for the second scanning. Therefore, the overall time it costs is O(n).

In the meantime, an array with 256 numbers is created, whose size is 1K. Since the size of array is constant, the space complexity of this algorithm is O(1).

The discussion about this problem is included in my book <Coding Interviews: Questions, Analysis & Solutions>, with some revisions. You may find the details of this book on Amazon.com, or Apress.

The author Harry He owns all the rights of this post. If you are going to use part of or the whole of this ariticle in your blog or webpages, please add a referenced to http://codercareer.blogspot.com/. If you are going to use it in your books, please contact me (zhedahht@gmail.com) . Thanks.

24 comments:

volkanJanuary 5, 2012 at 11:07 PM
how about using XOR?
ReplyDelete
Replies
David NeissMarch 27, 2013 at 11:24 PM
Re "Hash tables are complex, and they are not implemented in the C++ standard template library"

Cxx11 updated STL to include unordered associative maps (hashmaps), http://en.wikipedia.org/wiki/C%2B%2B11#Hash_tables
ReplyDelete
Replies
Harry HeMarch 28, 2013 at 10:14 PM
Thanks for your reminding
ReplyDelete
Replies
cosmos2020May 21, 2013 at 3:17 AM
Instead of taking an array of size 256 we can take an array of size 26(no. of english alphabet) + 1 (for end of string). We then index the array with the ascii value of the character and increment the array value held at that location if the corresonding alphabet is processed in the string given. With this we can find the number of repeating characters along with the first non-repeating character...
ReplyDelete
Replies
Liber WangAugust 28, 2013 at 6:17 AM
The question is "First Character". So, if string is 'abc123ac23', the answer should be 'b',right? Does your code output 'b' or '1'? Thanks,
ReplyDelete
Replies
AnonymousJanuary 14, 2014 at 3:29 AM
Instead of scanning twice you could just scan from end of string to beginning. As you scan, keep track of the most recent character for which you increment the appropriate hash value from 0 to 1. At the end of the scan, you will have the desired character.

This has the same time/space requirements, as Harry's solution above, but is maybe a little bit "cuter".
ReplyDelete
Replies
A. Thomas TranJune 6, 2014 at 7:39 AM
What if we have an Input String of 1 terabyte? I was thinking this: Instead of iterating over Input String again for all characters, enqueue new characters during the 1st loop. Then iterating through Queue would be constant time O(1) because Queue can only have 256 unique characters max. This gets us O(n+k) instead of O(n+n), it would only make sense if n.size() > 256 and vastly huge.
ReplyDelete
Replies
UnknownSeptember 2, 2014 at 7:56 PM
check this also for c# interview questions

@ http://skillgun.com
ReplyDelete
Replies
DeadlockAugust 28, 2016 at 3:48 PM
This comment has been removed by the author.
ReplyDelete
Replies
DeadlockAugust 28, 2016 at 3:52 PM
This comment has been removed by the author.
ReplyDelete
Replies
DeadlockAugust 28, 2016 at 3:53 PM
This comment has been removed by the author.
ReplyDelete
Replies
DeadlockAugust 28, 2016 at 3:54 PM
void Hashing::PrintFirstCharacterAppearingOnlyOnce(string str)
{
int count1 =0;
for(int i=0;ic<<endl;;
break;
}
count1 =0;
}
}

see full code on below link..

void Hashing::PrintFirstCharacterAppearingOnlyOnce(string str)
{
int count1 =0;
for(int i=0;ic<<endl;;
break;
}
count1 =0;
}
}

see full code on below link..

https://www.facebook.com/1411961805712287/photos/a.1754631978111933.1073741828.1411961805712287/1774229756152155/?type=3
ReplyDelete
Replies
AnonymousDecember 14, 2016 at 3:29 PM
Hi Harry, thanks for this - great stuff!

A good one here too that has to do with hash tables:

hash table interview question
ReplyDelete
Replies
UnknownFebruary 24, 2017 at 9:12 AM
import java.util.LinkedHashMap;

// abaccdeff
public class FirstNonRepeating {
public static void main(String[] args){
String str = "abaccdeff";
LinkedHashMap lMap = new LinkedHashMap();
for(int i=0;i<str.length();i++){
if(!lMap.containsKey(str.charAt(i))){
lMap.put(str.charAt(i), 0);
}else{
char temp = str.charAt(i);
int count = lMap.get(str.charAt(i)) + 1;
lMap.remove(temp);
lMap.put(temp, count);
}
}

Character key = lMap.keySet().iterator().next();
int value = lMap.get(key);
if(value==0){
System.out.println(" Fisrt non repeating character is - "+key);
}else{
System.out.println(" No non repeating character ");
}
}
}

I am trying to evaluate this Java solution. I used a LinkedHashMap to maintain the insert order which has a cost but I think the operations are still O(1). Please suggest.
ReplyDelete
Replies
UnknownJune 6, 2018 at 12:18 PM
Sain Bainuu,

Your writing shines like gold! There is no room for gibberish here clearly. You nailed it in No. 13 - First Character Appearing Only Once

I will have the mp3 files my customer buys on a wordpress page and a cart will direct them to that page. AWS Training If I want the mp3 files to be downloaded by the customer is there any reason to protect them except to keep them from being indexed by a search engine? Do I need to have a key or do a get operation other than have server-side encryption in S3?

Very useful article, if I run into challenges along the way, I will share them here.

Grazie,
Irene Hynes
ReplyDelete
Replies
UnknownJune 9, 2018 at 2:05 PM
Salemetsiz Be,

Grazie! Grazie! Grazie! Your blog is indeed quite interesting around No. 13 - First Character Appearing Only Once I agree with you on lot of points!

Since yesterday my EC2 instance in Singapore has become inaccessible to some US customers. Does anyone else have similar problems? AWS Training

Please keep providing such valuable information.

Cheers,
Ajeeth
ReplyDelete
Replies

Add comment