I've been reading around and it seems there is no very well coherent and fully accepted terminology for the URL parts. Is that true? I'd like to know which standards exists for URL parts terminology. What is the most common? Is there any well established standard? I found the following: <ol> <li>RFC3986 section 3</li> </ol> <hr> <pre class="prettyprint"><code> foo://example.com:8042/over/there?name=ferret#nose \_/ \______________/\_________/ \_________/ \__/ | | | | | scheme authority path query fragment | _____________________|__ / \ / \ urn:example:animal:ferret:nose </code></pre> <ol start="2"> <li> <code>window.location</code> from Javascript on browsers</li> </ol> <hr> <pre class="prettyprint"><code>protocol://username:password@hostname:port/pathname?search#hash -----------------------------href------------------------------ -----host---- ----------- origin ------------- </code></pre> <ul> <li> <code>protocol</code> - protocol scheme of the URL, including the final ':'</li> <li> <code>hostname</code> - domain name</li> <li> <code>port</code> - port number</li> <li> <code>pathname</code> - /pathname</li> <li> <code>search</code> - ?parameters</li> <li> <code>hash</code> - #fragment_identifier</li> <li> <code>username</code> - username specified before the domain name</li> <li> <code>password</code> - password specified before the domain name</li> <li> <code>href</code> - the entire URL</li> <li> <code>origin</code> - protocol://hostname:port</li> <li> <code>host</code> - hostname:port</li> </ul> <ol start="3"> <li>NodeJS, module <code>url</code> </li> </ol> <hr> Above the line with the URL you see node's <code>url</code> module old API, whilst under the line you see the new API. It seems node shifted from a RFC standard terminology to a more browser friendly standard terminology, that is, similar to browser's <code>windows.location</code>. <pre class="prettyprint"><code>┌────────────────────────────────────────────────────────────────────────────────────────────────┐ │ href │ ├──────────┬──┬─────────────────────┬────────────────────────┬───────────────────────────┬───────┤ │ protocol │ │ auth │ host │ path │ hash │ │ │ │ ├─────────────────┬──────┼──────────┬────────────────┤ │ │ │ │ │ hostname │ port │ pathname │ search │ │ │ │ │ │ │ │ ├─┬──────────────┤ │ │ │ │ │ │ │ │ │ query │ │ " https: // user : pass @ sub.example.com : 8080 /p/a/t/h ? query=string #hash " │ │ │ │ │ hostname │ port │ │ │ │ │ │ │ │ ├─────────────────┴──────┤ │ │ │ │ protocol │ │ username │ password │ host │ │ │ │ ├──────────┴──┼──────────┴──────────┼────────────────────────┤ │ │ │ │ origin │ │ origin │ pathname │ search │ hash │ ├─────────────┴─────────────────────┴────────────────────────┴──────────┴────────────────┴───────┤ │ href │ └────────────────────────────────────────────────────────────────────────────────────────────────┘ </code></pre> <ol start="4"> <li>Highly ranked article from Matt Cutts</li> </ol> <hr> <code>URL: http://video.google.co.uk:80/videoplay?docid=-7246927612831078230&hl=en#00h02m30s</code> <ul> <li>The protocol is http. Other protocols include https, ftp, etc.</li> <li>The host or hostname is video.google.co.uk.</li> <li>The subdomain is video.</li> <li>The domain name is google.co.uk.</li> <li>The top-level domain or TLD is uk. The uk domain is also referred to as a country-code top-level domain or ccTLD. For google.com, the TLD would be com.</li> <li>The second-level domain (SLD) is co.uk.</li> <li>The port is 80, which is the default port for web servers. Other ports are possible; a web server can listen on port 8000, for example. When the port is 80, most people leave out the port.</li> <li>The path is /videoplay. Path typically refers to a file or location on the web server, e.g. /directory/file.html</li> <li>This URL has parameters. The name of one parameter is docid and the value of that parameter is 7246927612831078230. URLs can have lots parameters. Parameters start with a question mark (?) and are separated with an ampersand (&).</li> </ul> <hr> Some of my concerns: <ol> <li> Is <code>window.location</code> a standard or based on a standard? </li> <li> Shall I call <code>http://</code> the <code>protocol</code> or the <code>scheme</code>? </li> <li> Shall I say <code>host</code> or <code>authority</code>? </li> <li> Why nor <code>window.location</code> nor node have properties for TLD or other domain parts, when available? </li> <li> The terminological difference between <code>hostname</code> (example.com) and <code>host</code> (example.com:8080) is well established? </li> <li> for node <code>origin</code> does not include <code>username:password@</code> whilst for <code>windows.location</code> it does </li> </ol> I'd like to follow on my code a well established standard or best practises.

The URI standard is STD 66. This is currently mapped to RFC 3986. So for the generic URI syntax, these terms are authoritative, currently: <ul> <li><code>scheme</code></li> <li><code>authority</code></li> <li><code>userinfo</code></li> <li><code>host</code></li> <li><code>port</code></li> <li><code>path</code></li> <li><code>query</code></li> <li><code>fragment</code></li> </ul>

URL parts canonical terminology

Tags:

url

terminology

I've been reading around and it seems there is no very well coherent and fully accepted terminology for the URL parts. Is that true? I'd like to know which standards exists for URL parts terminology. What is the most common? Is there any well established standard?

I found the following:

RFC3986 section 3

     foo://example.com:8042/over/there?name=ferret#nose
     \_/   \______________/\_________/ \_________/ \__/
      |           |            |            |        |
   scheme     authority       path        query   fragment
      |   _____________________|__
     / \ /                        \
     urn:example:animal:ferret:nose

window.location from Javascript on browsers

protocol://username:password@hostname:port/pathname?search#hash
-----------------------------href------------------------------
                             -----host----
-----------      origin      -------------

protocol - protocol scheme of the URL, including the final ':'
hostname - domain name
port - port number
pathname - /pathname
search - ?parameters
hash - #fragment_identifier
username - username specified before the domain name
password - password specified before the domain name
href - the entire URL
origin - protocol://hostname:port
host - hostname:port

NodeJS, module url

Above the line with the URL you see node's url module old API, whilst under the line you see the new API. It seems node shifted from a RFC standard terminology to a more browser friendly standard terminology, that is, similar to browser's windows.location.

┌────────────────────────────────────────────────────────────────────────────────────────────────┐
│                                              href                                              │
├──────────┬──┬─────────────────────┬────────────────────────┬───────────────────────────┬───────┤
│ protocol │  │        auth         │          host          │           path            │ hash  │
│          │  │                     ├─────────────────┬──────┼──────────┬────────────────┤       │
│          │  │                     │    hostname     │ port │ pathname │     search     │       │
│          │  │                     │                 │      │          ├─┬──────────────┤       │
│          │  │                     │                 │      │          │ │    query     │       │
"  https:   //    user   :   pass   @ sub.example.com : 8080   /p/a/t/h  ?  query=string   #hash "
│          │  │          │          │    hostname     │ port │          │                │       │
│          │  │          │          ├─────────────────┴──────┤          │                │       │
│ protocol │  │ username │ password │          host          │          │                │       │
├──────────┴──┼──────────┴──────────┼────────────────────────┤          │                │       │
│   origin    │                     │         origin         │ pathname │     search     │ hash  │
├─────────────┴─────────────────────┴────────────────────────┴──────────┴────────────────┴───────┤
│                                              href                                              │
└────────────────────────────────────────────────────────────────────────────────────────────────┘

Highly ranked article from Matt Cutts

URL: http://video.google.co.uk:80/videoplay?docid=-7246927612831078230&hl=en#00h02m30s

The protocol is http. Other protocols include https, ftp, etc.
The host or hostname is video.google.co.uk.
The subdomain is video.
The domain name is google.co.uk.
The top-level domain or TLD is uk. The uk domain is also referred to as a country-code top-level domain or ccTLD. For google.com, the TLD would be com.
The second-level domain (SLD) is co.uk.
The port is 80, which is the default port for web servers. Other ports are possible; a web server can listen on port 8000, for example. When the port is 80, most people leave out the port.
The path is /videoplay. Path typically refers to a file or location on the web server, e.g. /directory/file.html
This URL has parameters. The name of one parameter is docid and the value of that parameter is 7246927612831078230. URLs can have lots parameters. Parameters start with a question mark (?) and are separated with an ampersand (&).

Some of my concerns:

Is window.location a standard or based on a standard?
Shall I call http:// the protocol or the scheme?
Shall I say host or authority?
Why nor window.location nor node have properties for TLD or other domain parts, when available?
The terminological difference between hostname (example.com) and host (example.com:8080) is well established?
for node origin does not include username:password@ whilst for windows.location it does

I'd like to follow on my code a well established standard or best practises.

645

asked Feb 16 '19 11:02

João Pimentel Ferreira

2 Answers

The URI standard is STD 66. This is currently mapped to RFC 3986.

So for the generic URI syntax, these terms are authoritative, currently:

scheme
authority
userinfo
host
port
path
query
fragment

155

answered Oct 18 '22 23:10

unor

Terminology depends on which architectural style/technology you are using.

I prefer REST style for identifying different parts of my url REST URI Standard

But I repeat again there are no single universal standard to represent URL

answered Oct 18 '22 23:10

sandesh dahake

Related questions
                            
                                Unidentified 404 not found pages
                            
                                Get URL of all items tapped with WKWebView
                            
                                Uri.builder vs string based url construction
                            
                                access forbidden on url containing colon symbol, ":", on apache windows
                            
                                How to get an URL location from a server path for a property file
                            
                                How to Drag & Drop from Spotify to Winforms app
                            
                                Multiple query parameters with same name
                            
                                Server.transfer changing the URL a second time
                            
                                How to pass parameter to exe downloaded from web?
                            
                                URL fingerprint caching on Amazon S3
                            
                                MVC Routing Html.ActionLink creates URLs with ?id=1 instead of /id
                            
                                Parsing JSON string from URL (RESTful webservice) using GSON libraries. Android
                            
                                arabic query string - url did not stored and displayed
                            
                                Google Guice & Jersey multiple URL patterns to same servlet while applying package filtering
                            
                                What does this websocket url "ws://{{$}}/ws" mean?
                            
                                Why is this %2B string being urldecoded?
                            
                                Send innerHTML text in URL in JSP
                            
                                How to parse and decode URI in Java to URI components?
                            
                                Removing CNAME on Github
                            
                                Problems with URL handling with Spring Boot and Angular 2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With