Normalization for two bulk RNA-Seq samples to enable reliable fold-change estimation between...
Can we use the stored gravitational potential energy of a building to produce power?
How to solve a large system of linear algebra?
Writing a character who is going through a civilizing process without overdoing it?
Why are the books in the Game of Thrones citadel library shelved spine inwards?
Can a hotel cancel a confirmed reservation?
What is the purpose of easy combat scenarios that don't need resource expenditure?
How to say "Brexit" in Latin?
If I deleted a game I lost the disc for, can I reinstall it digitally?
Why exactly do action photographers need high fps burst cameras?
Table formatting top left corner caption
Can you tell from a blurry photo if focus was too close or too far?
Why do members of Congress in committee hearings ask witnesses the same question multiple times?
Would a National Army of mercenaries be a feasible idea?
Finding a mistake using Mayer-Vietoris
How to remove lines through the legend markers in ListPlot?
Why has the mole been redefined for 2019?
One Half of Ten; A Riddle
How much mayhem could I cause as a sentient fish?
Am I a Rude Number?
Citing paywalled articles accessed via illegal web sharing
How to limit sight distance to 1 km
Why would space fleets be aligned?
How to prevent users from executing commands through browser URL
Avoiding morning and evening handshakes
Normalization for two bulk RNA-Seq samples to enable reliable fold-change estimation between genes
Normalization methods with RNA-Seq ERCC spike in?Confirm success or failure RNA-Seq normalizationWhat are the ways to process a list of differentially expressed genes?What methods are available to find a cutoff value for non-expressed genes in RNA-seq?qPCR: Why is fold change and standard deviation calculated after transformation?Order of batch effects removal, data imputation and library size normalization in scRNA-seq dataDetecting differentially expressed genes with foldchange >= 2 and FDR < 0.05Selection of differential expressed genesK mean clustering issueHow to quantile normalization on RNA seq counts
$begingroup$
I have two bulk RNA-Seq samples, already tpm-normalized.
I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.
The distribution of the two samples using the common set of genes looks similar:
However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?
My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.
Do you also have suggestions for the definition of up-regulated genes with these data?
rna-seq normalization fold-change
$endgroup$
add a comment |
$begingroup$
I have two bulk RNA-Seq samples, already tpm-normalized.
I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.
The distribution of the two samples using the common set of genes looks similar:
However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?
My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.
Do you also have suggestions for the definition of up-regulated genes with these data?
rna-seq normalization fold-change
$endgroup$
add a comment |
$begingroup$
I have two bulk RNA-Seq samples, already tpm-normalized.
I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.
The distribution of the two samples using the common set of genes looks similar:
However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?
My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.
Do you also have suggestions for the definition of up-regulated genes with these data?
rna-seq normalization fold-change
$endgroup$
I have two bulk RNA-Seq samples, already tpm-normalized.
I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.
The distribution of the two samples using the common set of genes looks similar:
However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?
My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.
Do you also have suggestions for the definition of up-regulated genes with these data?
rna-seq normalization fold-change
rna-seq normalization fold-change
asked 10 hours ago
gc5gc5
721216
721216
add a comment |
add a comment |
3 Answers
3
active
oldest
votes
$begingroup$
It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.
$endgroup$
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
add a comment |
$begingroup$
What I have generally done in the past is to process the data using voom
in the limma
package for bulk RNASeq. Inside voom
you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.
voom
will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.
Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.
Good luck!
$endgroup$
add a comment |
$begingroup$
You have only two samples?
You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "676"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fbioinformatics.stackexchange.com%2fquestions%2f7142%2fnormalization-for-two-bulk-rna-seq-samples-to-enable-reliable-fold-change-estima%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.
$endgroup$
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
add a comment |
$begingroup$
It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.
$endgroup$
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
add a comment |
$begingroup$
It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.
$endgroup$
It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.
answered 9 hours ago
gringergringer
7,79221049
7,79221049
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
add a comment |
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
8 hours ago
add a comment |
$begingroup$
What I have generally done in the past is to process the data using voom
in the limma
package for bulk RNASeq. Inside voom
you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.
voom
will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.
Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.
Good luck!
$endgroup$
add a comment |
$begingroup$
What I have generally done in the past is to process the data using voom
in the limma
package for bulk RNASeq. Inside voom
you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.
voom
will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.
Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.
Good luck!
$endgroup$
add a comment |
$begingroup$
What I have generally done in the past is to process the data using voom
in the limma
package for bulk RNASeq. Inside voom
you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.
voom
will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.
Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.
Good luck!
$endgroup$
What I have generally done in the past is to process the data using voom
in the limma
package for bulk RNASeq. Inside voom
you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.
voom
will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.
Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.
Good luck!
answered 9 hours ago
h3ab74h3ab74
836
836
add a comment |
add a comment |
$begingroup$
You have only two samples?
You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.
$endgroup$
add a comment |
$begingroup$
You have only two samples?
You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.
$endgroup$
add a comment |
$begingroup$
You have only two samples?
You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.
$endgroup$
You have only two samples?
You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.
answered 7 hours ago
swbarnes2swbarnes2
48114
48114
add a comment |
add a comment |
Thanks for contributing an answer to Bioinformatics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fbioinformatics.stackexchange.com%2fquestions%2f7142%2fnormalization-for-two-bulk-rna-seq-samples-to-enable-reliable-fold-change-estima%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown