Tumgik
#data science process
cogitotech · 1 year
Link
0 notes
d0nutzgg · 9 months
Text
Tumblr media
Tonight I am hunting down venomous and nonvenomous snake pictures that are under the creative commons of specific breeds in order to create one of the most advanced, in depth datasets of different venomous and nonvenomous snakes as well as a test set that will include snakes from both sides of all species. I love snakes a lot and really, all reptiles. It is definitely tedious work, as I have to make sure each picture is cleared before I can use it (ethically), but I am making a lot of progress! I have species such as the King Cobra, Inland Taipan, and Eyelash Pit Viper among just a few! Wikimedia Commons has been a huge help!
I'm super excited.
Hope your nights are going good. I am still not feeling good but jamming + virtual snake hunting is keeping me busy!
40 notes · View notes
tagapagsalaysay · 1 year
Text
Tumblr media Tumblr media
Irene and Stanley Attempt Data Analytics by Hand
61 notes · View notes
Text
So deep in the methodology section that the next time someone even mention the word causal inference I am going to start chewing on their laptop without any explanation.
27 notes · View notes
cephalog0d · 9 months
Text
Hey, Batfans, are you a weird completionist who likes sortable data way too much, like me? Do you want a giant spreadsheet of appearances of a bunch of Batfam members that you can filter across multiple people to see where they show up in the same issue together?
WELL GOOD NEWS because thanks to my hyperfixation I made a spreadsheet. (It's view-only but if you save a copy it should be editable for personal use.)
The Spreadsheet currently contains all post-crisis appearances for the following characters: Barbara Gordon, Cassandra Cain, Damian Wayne, Dick Grayson, Duke Thomas, Harper Row, Helena Bertinelli, Jace Fox, Jarro, Jason Todd, Jean-Paul Valley, Kate Kane, Luke Fox, Stephanie Brown and Tim Drake. I feel like that's most of the big ones (and several not-very-big-ones), but if there's a Bat-person missing you'd like to see on there, feel free to ask!
This is up to date through July 2023. I have intentions to keep updating it on a semi-monthly basis, but we'll see if that happens.
All sheets are conditionally formatted so if you enter "Y" in the Read column it will highlight the whole row in green to mark it off, if you're the kind of person who likes to keep track and mark things off a list.
The Master List is filterable by any character, and more importantly, multiple characters! Up to "all of them" although I don't think anything actually contains *all* of them.
(Some more notes below)
SOME NOTES:
Dates are the start of the series, since that's how a lot of places besides DC itself with their weird "volume" convention distinguish different runs.
These aren't sorted by preboot vs. New 52 vs. Rebirth vs. IF, sorry, that was too many sorting functions for now. You can kinda figure it out by date, though (New 52 was 2011, Rebirth was 2016, IF was 2022) or look up the issue on a wiki and see what version of the character is tagged.
On that note, all of this was pulled from the DC Wiki, and while I did a little bit of spot-checking as I went for things I knew off the top of my head it's entirely possible things are missing or mis-attributed. I'm happy to update accordingly if there are.
Similarly, I didn't go through every issue here to check what role people are appearing in, either in terms of what identity they're using (e.g. Spoiler vs Robin vs Batgirl) or if they're a major character or not. Some of these are as minor as background appearances or off-screen mentions. Some day I might add more metadata to sort for those things, but right now that's not part of it.
14 notes · View notes
Text
Anyone good at data analysis pls give me tips on how to explain the process for an interview bc literally I black out for 6 hours and when I come to the analysis is done and I don't remember how I did anything. This is not something interviewers want to hear.
4 notes · View notes
techmeright · 4 months
Text
How To Use Perplexity AI And Its Top 5 Features
Perplexity AI makes use of artificial intelligence to help users locate and retrieve information, doing away with the need for tiresome hours spent searching the internet and viewing sites. In contrast to well-known AI chatbots such as ChatGPT, Perplexity serves as a real-time internet search engine that looks up answers to user inquiries.Perplexity can respond to a variety of questions, offer…
Tumblr media
View On WordPress
2 notes · View notes
ephemeral-winter · 5 months
Text
definitely love to meet with my advisor a week before my capstone project is due and have it become clear that he did not read the 3800 word partial draft i sent him two weeks ago and is instead still referring to an outline i wrote in october. definitely makes me feel that he knows what i am doing and what my project is and how to help me
2 notes · View notes
bits-of-ds · 1 year
Text
Cosine Similarity; For checking similarity of documents, etc.
Cosine similarity is a measure of checking the similarity between two documents, texts, strings, etc.
It does so by representing the query as vectors in n-dimensional space. It then measures the angle between these vectors and gives the similarity based on the cosine of this angle.
If the queries are completely similar the angle will be zero; Thus the cosine similarity will be: > cos(angle_between_the _vectors)=cos(0)= 1
If the queries are completely dissimilar the vectors will be perpendicular; Thus the cosine similarity will be: > cos(angle_between_the _vectors)=cos(90)= 0
If the queries are completely opposite the vectors will be opposite to each other; Thus the cosine similarity will be: > cos(angle_between_the _vectors)=cos(180)= -1
The cosine similarity, mathematically, is given by:
Tumblr media
Let's see an example:
Doc1 = "this is the first document" Doc2 = "this document is second in this order"
Vector representation of these documents: Doc1 = [1,1,1,0,1,1,0,0] Doc2 = [1,0,1,1,1,2,1,1]
Tumblr media
ΣAiBi = (1*1)+(1*0)+(0*1)+(1*1)+(0*1)+(0*1)+(1*0)+(1*2) = 4 √(ΣAi)^2 = √(1+1+0+1+0+0+1+1) = √5 √(ΣBi)^2 = √(1+0+1+1+1+1+0+4) = √9
Cosine similarity = 4/(√5*√9) = 0.59
The Cosine Similarity is a better metric than Euclidean distance because if the two text document far apart by Euclidean distance, there are still chances that they are close to each other in terms of their context.
12 notes · View notes
tarathecogsci · 2 years
Text
looking for studyblrs / blogs
leave a message if you post about
deep learning applications specifically for NLP
deep reinforcement learning
data science and data analytics
regression models
tensorflow
computational linguistics (bonus if it's about emergent languages)
practical ethics of cognitive science applications
cognitive neuropsychology
or if you are just cool in general <3
Will leave a follow in return! I need new active blogs to follow
30 notes · View notes
datascienceunicorn · 1 year
Text
11 notes · View notes
nellectronic · 3 months
Text
someone needs to bully me into working on my thesis
1 note · View note
d0nutzgg · 1 year
Text
Analyzing An Ataxic Dysarthria Patient's Speech with Computer Vision and Audio Processing
Hey everyone, so as you know I have been doing research on patients like myself who have Ataxic Dysarthria and other neurological speech disorders related to diseases and conditions that affect the brain. I was analyzing this file
with a few programs that I have written.
The findings are very informative and I am excited that I am able to explain this to my Tumblr following as I feel it not only promotes awareness but provides an understanding of what we go through with Ataxic Dysarthria.
Analysis of the audio file with an Intonation Visualizer I built
Tumblr media
As you can tell this uses a heatmap to visualize loudness and softness of a speaker's voice. I used it to analyze the file and I found some really interesting and telling signs of Ataxic Dysarthria
Tumblr media
At 0-1 seconds it is mostly pretty quiet (which is normal because it is harder for patients with AD to start their speaking off. You can notice that around 1-3 seconds it gets louder, and then when she speaks its clearer and louder than the patients voice. However the AD makes the patients speech constantly rise and fall in loudness from around -3 to 0 decibels most of the audio when the patient is speaking. The variation though between 0 and -3 varies quickly though which is a common characteristic in AD
The combination of the constant rising and falling in loudness and intonation as well as problems getting sentences started is one of the things that makes it so hard for people to understand those with Ataxic Dysarthria.
The second method I used is using a line graph (plotted) that gives an example of the rate of speech and elongated syllables of the patient.
Tumblr media
As you can see I primarily used the Google Speech Recognition library to transcribe and count the syllables using Pyphen via "hyphenated" (elongated) words in the speech of the patient. This isn't the most effective method but it worked well for this example and here is the results plotted out using Matplotlib:
Tumblr media
As you can see when they started talking at first there was a rise from the softer speech, as the voice of the patient got louder, they were speaking faster (common for those with AD / and HD) my hypothesis (and personal experience) is that this is how we try to get our words out where we can be understood by "forcing" out words resulting in a rise and fall of syllables / rate of speech that we see at the first part. The other spikes typically happen when she speaks but there is another spike at the end which you can see as well when the patient tries to force more words out.
This research already indicates a pretty clear pattern what is going on in the patients speech. As they try to force out words, their speech gets faster and thus gets louder as they try to communicate.
I hope this has been informative for those who don't know much about speech pathology or neurological diseases. I know it's already showing a lot of exciting progress and I am continuing to develop scripts to further research on this subject so maybe we can all understand neurological speech disorders better.
As I said, I will be posting my research and findings as I go. Thank you for following me and keeping up with my posts!
38 notes · View notes
redbixbite-solutions · 9 months
Text
Everything You Need to Know About Machine Learning
Ready to step into the world of possibilities with machine learning? Learn all about machine learning and its cutting-edge technology. From what do you need to learn before using it to where it is applicable and their types, join us as we reveal the secrets. Read along for everything you need to know about Machine Learning!
Tumblr media
What is Machine Learning?
Machine Learning is a field of study within artificial intelligence (AI) that concentrates on creating algorithms and models which enable computers to learn from data and make predictions or decisions without being explicitly programmed. The process involves training a computer system using copious amounts of data to identify patterns, extract valuable information, and make precise predictions or decisions.
Fundamentally, machine Learning relies on statistical techniques and algorithms to analyze data and discover patterns or connections. These algorithms utilize mathematical models to process and interpret data. Revealing significant insights that can be utilized across various applications by different AI ML services.
What do you need to know for Machine Learning?
You can explore the exciting world of machine learning without being an expert mathematician or computer scientist. However, a  basic understanding of statistics, programming, and data manipulation will benefit you. Machine learning involves exploring patterns in data, making predictions, and automating tasks.
 It has the potential to revolutionize industries. Moreover, it can improve healthcare and enhance our daily lives. Whether you are a beginner or a seasoned professional embracing machine learning can unlock numerous opportunities and empower you to solve complex problems with intelligent algorithms.
Types of Machine Learning
Let’s learn all about machine learning and know about its types.
Supervised Learning
Supervise­d learning resemble­s having a wise mentor guiding you eve­ry step of the way. In this approach, a machine le­arning model is trained using labele­d data wherein the de­sired outcome is already known.
The­ model gains knowledge from the­se provided example­s and can accurately predict or classify new, unse­en data. It serves as a highly e­ffective tool for tasks such as dete­cting spam, analyzing sentiment, and recognizing image­s.
Unsupervised Learning
In the re­alm of unsupervised learning, machine­s are granted the autonomy to e­xplore and unveil patterns inde­pendently. This methodology mainly ope­rates with unlabeled data, whe­re models strive to une­arth concealed structures or re­lationships within the information.
It can be likene­d to solving a puzzle without prior knowledge of what the­ final image should depict. Unsupervise­d learning finds frequent application in dive­rse areas such as clustering, anomaly de­tection, and recommendation syste­ms.
Reinforcement Learning
Reinforce­ment learning draws inspiration from the way humans le­arn through trial and error. In this approach, a machine learning mode­l interacts with an environment and acquire­s knowledge to make de­cisions based on positive or negative­ feedback, refe­rred to as rewards.
It's akin to teaching a dog ne­w tricks by rewarding good behavior. Reinforce­ment learning finds exte­nsive applications in areas such as robotics, game playing, and autonomous ve­hicles.
Machine Learning Process
Now that the diffe­rent types of machine le­arning have been e­xplained, we can delve­ into understanding the encompassing proce­ss involved.
To begin with, one­ must gather and prepare the­ appropriate data. High-quality data is the foundation of any triumph in a machine le­arning project.
Afterward, one­ should proceed by sele­cting an appropriate algorithm or model that aligns with their spe­cific task and data type. It is worth noting that the market offe­rs a myriad of algorithms, each possessing unique stre­ngths and weaknesses.
Next, the machine goes through the training phase. The model learns from making adjustments to its internal parameters and labeled data. This helps in minimizing errors and improves its accuracy.
Evaluation of the machine’s performance is a significant step. It helps assess machines' ability to generalize new and unforeseen data. Different types of metrics are used for the assessment. It includes measuring accuracy, recall, precision, and other performance indicators.
The last step is to test the machine for real word scenario predictions and decision-making. This is where we get the result of our investment. It helps automate the process, make accurate forecasts, and offer valuable insights. Using the same way. RedBixbite offers solutions like DOCBrains, Orionzi, SmileeBrains, and E-Governance for industries like agriculture, manufacturing, banking and finance, healthcare, public sector and government, travel transportation and logistics, and retail and consumer goods.
Applications of Machine Learning
Do you want to know all about machine learning? Then you should know where it is applicable.
Natural Language Processing (NLP)- One are­a where machine le­arning significantly impacts is Natural Language Processing (NLP). It enables various applications like­ language translation, sentiment analysis, chatbots, and voice­ assistants. Using the prowess of machine le­arning, NLP systems can continuously learn and adapt to enhance­ their understanding of human language ove­r time.
Computer Vision- Computer Vision pre­sents an intriguing application of machine learning. It involve­s training computers to interpret and compre­hend visual information, encompassing images and vide­os. By utilizing machine learning algorithms, computers gain the­ capability to identify objects, faces, and ge­stures, resulting in the de­velopment of applications like facial re­cognition, object detection, and autonomous ve­hicles.
Recommendation Systems- Recomme­ndation systems have become­ an essential part of our eve­ryday lives, with machine learning playing a crucial role­ in their developme­nt. These systems care­fully analyze user prefe­rences, behaviors, and patte­rns to offer personalized re­commendations spanning various domains like movies, music, e­-commerce products, and news article­s.
Fraud Detection- Fraud dete­ction poses a critical concern for businesse­s. In this realm, machine learning has e­merged as a game-change­r. By meticulously analyzing vast amounts of data and swiftly detecting anomalie­s, machine learning models can ide­ntify fraudulent activities in real-time­.
Healthcare- Machine learning has also made great progress in the healthcare sector. It has helped doctors and healthcare professionals make precise and timely decisions by diagnosing diseases and predicting patient outcomes. Through the analysis of patient data, machine learning algorithms can detect patterns and anticipate possible health risks, ultimately resulting in early interventions and enhanced patient care.
In today's fast-paced te­chnological landscape, the field of artificial inte­lligence (AI) has eme­rged as a groundbreaking force, re­volutionizing various industries. As a specialized AI de­velopment company, our expe­rtise lies in machine le­arning—a subset of AI that entails creating syste­ms capable of learning and making predictions or de­cisions without explicit programming.
Machine learning's wide­spread applications across multiple domains have transforme­d businesses' operations and significantly e­nhanced overall efficie­ncy.
3 notes · View notes
zankalony · 1 year
Text
The latest trends and innovations in artificial intelligence
How AI is transforming the world and what it means for you The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.— Edsger W. Dijkstra Artificial intelligence (AI) is one of the most exciting and rapidly evolving fields of technology today. From self-driving cars to smart assistants, AI is changing the way we live, work, and play.…
Tumblr media
View On WordPress
3 notes · View notes
digitalpolarsblog · 1 year
Text
9 notes · View notes