Updated: Monday, October 16,2023-10-16 12:42:42

main
Shwetha Jayaraj 2023-10-16 12:42:43 -04:00
parent 2d69da9a74
commit 4e5db77998
3 changed files with 15 additions and 12 deletions

View File

@ -1,13 +1,13 @@
{ {
"recentFiles": [ "recentFiles": [
{
"basename": "Webscraping",
"path": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md"
},
{ {
"basename": "Robots.txt Files", "basename": "Robots.txt Files",
"path": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md" "path": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md"
}, },
{
"basename": "Webscraping",
"path": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md"
},
{ {
"basename": "Potentiometers & Analog SerialReader", "basename": "Potentiometers & Analog SerialReader",
"path": "Machine Tips (Quantum)/Physics/Hardware/Potentiometers & Analog SerialReader.md" "path": "Machine Tips (Quantum)/Physics/Hardware/Potentiometers & Analog SerialReader.md"

View File

@ -25,7 +25,7 @@
"state": { "state": {
"type": "markdown", "type": "markdown",
"state": { "state": {
"file": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md", "file": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md",
"mode": "source", "mode": "source",
"source": false "source": false
} }
@ -107,7 +107,7 @@
"state": { "state": {
"type": "backlink", "type": "backlink",
"state": { "state": {
"file": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md", "file": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md",
"collapseAll": false, "collapseAll": false,
"extraContext": false, "extraContext": false,
"sortOrder": "alphabetical", "sortOrder": "alphabetical",
@ -124,7 +124,7 @@
"state": { "state": {
"type": "outgoing-link", "type": "outgoing-link",
"state": { "state": {
"file": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md", "file": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md",
"linksCollapsed": false, "linksCollapsed": false,
"unlinkedCollapsed": true "unlinkedCollapsed": true
} }
@ -147,7 +147,7 @@
"state": { "state": {
"type": "outline", "type": "outline",
"state": { "state": {
"file": "Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md" "file": "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md"
} }
} }
} }
@ -174,8 +174,9 @@
"obsidian-excalidraw-plugin:Create new drawing": false "obsidian-excalidraw-plugin:Create new drawing": false
} }
}, },
"active": "dbad7b010371d947", "active": "0a0de85a51848b9d",
"lastOpenFiles": [ "lastOpenFiles": [
"Coding Tips (Classical)/Terminal Tips/GUIs/Tools/Webscraping.md",
"Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md", "Coding Tips (Classical)/Terminal Tips/GUIs/Internet/Websites/Robots.txt Files.md",
"Excalidraw/Drawing 2023-10-16 12.13.42.excalidraw.md", "Excalidraw/Drawing 2023-10-16 12.13.42.excalidraw.md",
"Machine Tips (Quantum)/Physics/Hardware/Potentiometers & Analog SerialReader.md", "Machine Tips (Quantum)/Physics/Hardware/Potentiometers & Analog SerialReader.md",
@ -206,7 +207,6 @@
"Untitled.canvas", "Untitled.canvas",
"Coding Tips (Classical)/Project Vault/Current Occupations/Manhattan Youth", "Coding Tips (Classical)/Project Vault/Current Occupations/Manhattan Youth",
"Coding Tips (Classical)/Project Vault/Current Occupations/Website Projects/My Domain Names.md", "Coding Tips (Classical)/Project Vault/Current Occupations/Website Projects/My Domain Names.md",
"Coding Tips (Classical)/Project Vault/Current Occupations/Potential and Future/Career Tips.md",
"Coding Tips (Classical)/Project Vault/About Obsidian/imgFiles/Pasted image 20231011091043.png", "Coding Tips (Classical)/Project Vault/About Obsidian/imgFiles/Pasted image 20231011091043.png",
"Coding Tips (Classical)/Project Vault/About Obsidian/Slides & Tools/export/Slides/plugin/chalkboard/_style.css", "Coding Tips (Classical)/Project Vault/About Obsidian/Slides & Tools/export/Slides/plugin/chalkboard/_style.css",
"Coding Tips (Classical)/Project Vault/About Obsidian/Slides & Tools/export/Slides/plugin/chalkboard/img/blackboard.png", "Coding Tips (Classical)/Project Vault/About Obsidian/Slides & Tools/export/Slides/plugin/chalkboard/img/blackboard.png",

View File

@ -1,6 +1,9 @@
Robots.txt is an increasingly important file found on websites that determine whether you permit a website crawler to index your page for search engine optimization. As webscraping is entirely legal in the US, this is the wild west of scraping and thus I want to keep mu brain and information safe from scraping. Robots.txt is an increasingly important file found on websites that determine whether you permit a website crawler to index your page for search engine optimization. As web-scraping is entirely legal in the US, this is the wild west of scraping and thus I want to keep mu brain and information safe from scraping.
Fun Fact: Google [open-sourced](https://opensource.googleblog.com/2019/07/googles-robotstxt-parser-is-now-open.html) their [robots.txt parser](https://github.com/google/robotstxt) in 2019 f you want to see an example of reverse engineering the robots.txt file for search indexing.
*Resources*: *Resources*:
- [Robots.txt file examples](https://blog.hubspot.com/marketing/robots-txt-file) - [Robots.txt file examples](https://blog.hubspot.com/marketing/robots-txt-file)
- Robots.txt [generator tool](https://www.internetmarketingninjas.com/tools/robots-txt-generator/)