uris - Fork of rhymuri with a small bug fix

Age	Commit message (Collapse)	Author
2018-07-04	Fix bug in truncated host elements	Richard Walters
	For example, "[::1", where the square bracket at the end is missing. Handle truncated host element by checking the state we end up in after the entire string is parsed. Some states represent interal elements of a host name or address, and so if we're still in those states and run out of input characters, the input string was cut off early.
2018-07-04	Fix bug in parsing out IPv6 and IPvFuture addresses	Richard Walters
	Don't include the square brackets in the parsed out host string; they are only there for delimiting them inside of an overall URI string.
2018-07-04	Validate IPv6 addresses	Richard Walters
	* Add ValidateIpv6Address. * Add ValidateIpv4Address (since an IPv6 address is allowed to contain an IPv4 address for compatibility) * Add ValidateOctet (used by ValidateIpv4Address).
2018-07-04	Refactoring	Richard Walters
	* Extract CanNavigatePathUpOneLevel from NormalizePath. * Add comments to explain what's going on elsewhere in NormalizePath.
2018-07-04	Add missing "ok" return values in extracted methods	Richard Walters

2018-07-04	Give names to states in host/port parsing state machine	Richard Walters

2018-07-04	Refactoring	Richard Walters
	* Extract methods to copy various elements of one URI from another. * Push NormalizePath implementation into a private method. * Simplify and consolidate checks for absolute paths. * Extract methods out of individual steps of ParseFromString.
2018-07-03	Document parts of the path normalization process	Richard Walters
	Add comments that link parts of the code back to lines of the pseudocode in the RFC, to make the code easier to understand.
2018-07-03	Complete rewrite of NormalizePath	Richard Walters
	The former algorithm was based on the pseuocode from the RFC, which is hard to follow, more suitable when the path is in a single string, not a sequence of segments. The new algorithm uses two flags: * isAbsolute - recognize that if the path starts out as an absolute path, it needs to stay that way. * atDirectoryLevel - recognize that if we encounter a "." or "..", then it will be reduced by simply discarding it or going back/up one stop, but then we will be in a "directory" context, meaning that should we end the path at this point, there needs to be an empty-string segment to mark that the end of the path is reaching into a directory, not just referring to the directory.
2018-07-02	Add reference resolution and attempt to fix path normalization	Richard Walters
	Path normalization is hideously broken for now.
2018-07-02	Allow default move semantics	Richard Walters

2018-07-02	Recognize special case of absolute URI with empty path	Richard Walters
	Such a URI should be considered equivalent to a path of "/" because in both cases the path is an absolute path.
2018-07-02	Add more path normalization tests and fix a bug in it	Richard Walters
	For normalization "step 2C", if the output path was empty, we don't want to pop the end of it off.
2018-07-02	Add capability to compare Uri objects.	Richard Walters
	* Code the neat example in section 6.2.2 of the RFC. * Add equality/inequality operators for Uri.
2018-07-02	Add NormalizePath method	Richard Walters

2018-07-02	Refactoring	Richard Walters
	Extract methods that parse the query and fragment.
2018-07-02	Refactoring	Richard Walters
	* Replaced the more formal "state machine" used in URI elements that may have percent-encoded characters, with a simpler loop with a flag and a few conditional logic paths. * Extracted the parsing of the above types of elements into a common method, DecodeElement. * Kept DecodeQueryOrFragment around, in order to prevent having to repeat the name of the allowed character set which is common between query and fragment; however the function is now just a very thin wrapper.
2018-07-01	Refactoring	Richard Walters
	* Remove IsCharacterInSet function
2018-07-01	Rename IsCharacterInSet module to CharacterSet	Richard Walters

2018-07-01	Normalize scheme and reg-name elements to lower case	Richard Walters

2018-07-01	Allow HEXDIG to include lower-case 'a'..'f'	Richard Walters

2018-07-01	Refactoring	Richard Walters
	Added CharacterSet as a class to represent character sets, allowing us to build singletons and composite character sets more concisely.
2018-07-01	Refactoring	Richard Walters
	* Extract IsCharacterInSet to its own module. * Extract PercentEncodedCharacterDecoder to its own module.
2018-07-01	Refactoring	Richard Walters
	Remove state 3 hole in host/port parsing state machine
2018-07-01	Refactoring	Richard Walters
	Extract percent-encoded character decoding, so that the logic is all in one class that is reused.
2018-07-01	Added missing documentation	Richard Walters

2018-07-01	Check for illegal characters in query and fragment elements	Richard Walters

2018-07-01	Check for illegal characters in path segments	Richard Walters

2018-07-01	Fix second bug in scheme delimiter searching	Richard Walters
	Path may also have colon, so make sure we don't scan into the path element if there is one.
2018-07-01	Handle bad host names	Richard Walters
	* Detect bad characters in host names. * Incorporate splitting host and port into the state machine that is parsing/decoding the host. NOTE: IPv6address is not checked for bad characters yet. More research is needed to learn exactly what are the various ways to write an IPv6 address.
2018-07-01	Fix bug in parsing scheme	Richard Walters
	A colon may be in the authority, if present, so limit the search for scheme delimiter so we aren't scanning the authority part, when parsing the scheme.
2018-07-01	Handle bad characters in UserInfo	Richard Walters

2018-06-30	Refactoring	Richard Walters
	Extracted IsCharacterInSet function
2018-06-30	Add code to check that scheme, if present, is legal	Richard Walters

2018-06-30	Refactoring	Richard Walters
	Extract method ParseAuthority
2018-06-30	Refactoring	Richard Walters
	Extract method that parses the path segments from the whole path string.
2018-06-30	Refactoring	Richard Walters
	* Extract function that parses 16-bit unsigned integers, to use in parsing port element. * Clean up and clarify what parts of the original URI string are still being held onto at various points in the code.
2018-06-30	Fix bug in not clearing userInfo when there is no authority	Richard Walters

2018-06-30	Add more element parsing of URIs	Richard Walters
	* Add IsRelativeReference. * Add IsRelativePath. * Add Query. * Add Fragment. * Add UserInfo. * Fix parsing of URIs that have no scheme.
2018-06-30	Add support for port and hasPort elements	Richard Walters

2018-06-30	Uri: fix mistakes from last session	Richard Walters
	* Parts of a path are called "segments", not "steps", in the RFC. * The RFC specifies that path separators are always forward slashes, so don't support other separators.
2018-06-30	Kick off Uri component	Richard Walters
	* Can now parse URIs from strings. * This supports scheme, host, and path. * Path separator defaults to "/" but may be customized.
2018-06-02	Initial Revision.	Richard Walters