Radix sort (#28)

* Add in-place sort to QuickSort.js

* Fix linting errors and clean up comments

* Change implementation to address lint errors

* Trailing space and undefined variable

* Create own class for in-place quicksort and use tests

* Add trailing space at end of file

* Fix placement of visitedCallback, explain itial destructuring

* Implement Radix Sort

Remove excess line

Reorganize RadixSort of match structure of other classes

Write tests for RadixSort and additional test constants

Create README

Update main readme to include radix sort
This commit is contained in:
Robert Taussig 2018-05-29 01:38:19 -04:00 committed by Oleksii Trekhleb
parent cca138a384
commit 71985337b5
5 changed files with 150 additions and 0 deletions

View File

@ -81,6 +81,7 @@ a set of rules that precisely defines a sequence of operations.
* [Quicksort](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/sorting/quick-sort) - in-place and non-in-place implementations
* [Shellsort](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/sorting/shell-sort)
* [Counting Sort](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/sorting/counting-sort)
* [Radix Sort](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/sorting/radix-sort)
* **Tree**
* [Depth-First Search](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/tree/depth-first-search) (DFS)
* [Breadth-First Search](https://github.com/trekhleb/javascript-algorithms/tree/master/src/algorithms/tree/breadth-first-search) (BFS)
@ -227,3 +228,4 @@ Below is the list of some of the most used Big O notations and their performance
| **Quick sort** | n log(n) | n log(n) | n^2 | log(n) | No | |
| **Shell sort** | n log(n) | depends on gap sequence | n (log(n))^2 | 1 | No | |
| **Counting sort** | n + r | n + r | n + r | n + r | Yes | r - biggest number in array |
| **Radix sort** | n * k | n * k | n * k | n + k | Yes | k - length of longest key |

View File

@ -2,6 +2,8 @@ export const sortedArr = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16,
export const reverseArr = [20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1];
export const notSortedArr = [15, 8, 5, 12, 10, 1, 16, 9, 11, 7, 20, 3, 2, 6, 17, 18, 4, 13, 14, 19];
export const equalArr = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1];
export const stringArr = ['zzz', 'bb', 'a', 'rr', 'rrb', 'rrba'];
export const intArr = [3, 1, 75, 32, 884, 523, 4343456, 232, 123, 656, 343];
export class SortTester {
static testSort(SortingClass) {

View File

@ -0,0 +1,13 @@
# Radix Sort
In computer science, radix sort is a non-comparative integer sorting algorithm that sorts data with integer keys by grouping keys by the individual digits which share the same significant position and value. A positional notation is required, but because integers can represent strings of characters (e.g., names or dates) and specially formatted floating point numbers, radix sort is not limited to integers. Radix sort dates back as far as 1887 to the work of Herman Hollerith on tabulating machines.
## Efficiency
The topic of the efficiency of radix sort compared to other sorting algorithms is somewhat tricky and subject to quite a lot of misunderstandings. Whether radix sort is equally efficient, less efficient or more efficient than the best comparison-based algorithms depends on the details of the assumptions made. Radix sort complexity is O(wn) for n keys which are integers of word size w. Sometimes w is presented as a constant, which would make radix sort better (for sufficiently large n) than the best comparison-based sorting algorithms, which all perform O(n log n) comparisons to sort n keys. However, in general w cannot be considered a constant: if all n keys are distinct, then w has to be at least log n for a random-access machine to be able to store them in memory, which gives at best a time complexity O(n log n).[2] That would seem to make radix sort at most equally efficient as the best comparison-based sorts (and worse if keys are much longer than log n).
The counter argument is that comparison-based algorithms are measured in number of comparisons, not actual time complexity. Under some assumptions the comparisons will be constant time on average, under others they will not. Comparisons of randomly generated keys takes constant time on average, as keys differ on the very first bit in half the cases, and differ on the second bit in half of the remaining half, and so on, resulting in an average of two bits that need to be compared. In a sorting algorithm the first comparisons made satisfies the randomness condition, but as the sort progresses the keys compared are clearly not randomly chosen anymore. For example, consider a bottom-up merge sort. The first pass will compare pairs of random keys, but the last pass will compare keys that are very close in the sorting order. This makes merge sort, on this class of inputs, take O(n (log n)2) time. That assumes all memory accesses cost the same, which is not a physically reasonable assumption as we scale n to infinity, and not, in practice, how real computers work.
## References
- [Wikipedia](https://en.wikipedia.org/wiki/Radix_sort)

View File

@ -0,0 +1,102 @@
import Sort from '../Sort';
export default class RadixSort extends Sort {
sort(originalArray) {
const isNumber = (element) => {
return Number.isInteger(element);
};
const createBuckets = (numBuckets) => {
/**
* Mapping buckets to an array instead of filling them with
* an array prevents each bucket from containing a reference to the same array
*/
return new Array(numBuckets).fill(null).map(() => []);
};
const placeElementsInNumberBuckets = (array, index) => {
// See below. These are used to determine which digit to use for bucket allocation
const modded = 10 ** (index + 1);
const divided = 10 ** index;
const buckets = createBuckets(10);
array.forEach((element) => {
this.callbacks.visitingCallback(element);
if (element < divided) {
buckets[0].push(element);
} else {
/**
* Say we have element of 1,052 and are currently on index 1 (starting from 0). This means
* we want to use '5' as the bucket. `modded` would be 10 ** (1 + 1), which
* is 100. So we take 1,052 % 100 (52) and divide it by 10 (5.2) and floor it (5).
*/
const currentDigit = Math.floor((element % modded) / divided);
buckets[currentDigit].push(element);
}
});
return buckets;
};
const placeElementsInCharacterBuckets = (array, index, numPasses) => {
const getCharCodeOfElementAtIndex = (element) => {
// Place element in last bucket if not ready to organize
if ((numPasses - index) > element.length) return 25;
// Using charCode (a = 97, b = 98, etc), we can map characters to buckets from 0 - 25
const BASE_CHAR_CODE = 97;
/**
* If each character has been organized, use first character to determine bucket,
* otherwise iterate backwards through element
*/
const charPos = index > element.length - 1 ? 0 : element.length - index - 1;
return element.toLowerCase().charCodeAt(charPos) - BASE_CHAR_CODE;
};
const buckets = createBuckets(26);
array.forEach((element) => {
this.callbacks.visitingCallback(element);
const currentBucket = getCharCodeOfElementAtIndex(element);
buckets[currentBucket].push(element);
});
return buckets;
};
// Assumes all elements of array are of the same type
const isArrayOfNumbers = isNumber(originalArray[0]);
/** Number of passes is determined by the length of the longest element in the array.
* For integers, this log10(num), and for strings, this would be the lenght of the string.
*/
const determineNumPasses = () => {
const getLengthOfLongestElement = () => {
if (isArrayOfNumbers) {
return Math.floor(Math.log10(Math.max(...originalArray))) + 1;
}
return originalArray.reduce((acc, val) => {
return val.length > acc ? val.length : acc;
}, -Infinity);
};
return getLengthOfLongestElement(originalArray);
};
let sortedArray = [...originalArray];
const numPasses = determineNumPasses();
for (let currentIndex = 0; currentIndex < numPasses; currentIndex += 1) {
const buckets = isArrayOfNumbers ?
placeElementsInNumberBuckets(sortedArray, currentIndex) :
placeElementsInCharacterBuckets(sortedArray, currentIndex, numPasses);
// Flatten buckets into sortedArray, and repeat at next index
sortedArray = buckets.reduce((acc, val) => {
return [...acc, ...val];
}, []);
}
return sortedArray;
}
}

View File

@ -0,0 +1,31 @@
import RadixSort from '../RadixSort';
import {
stringArr,
intArr,
SortTester,
} from '../../SortTester';
// Complexity constants.
const ARRAY_OF_STRINGS_VISIT_COUNT = 24;
const ARRAY_OF_INTEGERS_VISIT_COUNT = 77;
describe('RadixSort', () => {
it('should sort array', () => {
SortTester.testSort(RadixSort);
});
it('should visit array of strings n (number of strings) x m (length of longest element) times', () => {
SortTester.testAlgorithmTimeComplexity(
RadixSort,
stringArr,
ARRAY_OF_STRINGS_VISIT_COUNT,
);
});
it('should visit array of integers n (number of elements) x m (length of longest integer) times', () => {
SortTester.testAlgorithmTimeComplexity(
RadixSort,
intArr,
ARRAY_OF_INTEGERS_VISIT_COUNT,
);
});
});